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"^The  second  part  is  concerned  with  signal  detection  in  bounded 
noise.  Both  the  known  signal  case  and  the  unknown  signal  case 
are  studied.  An  algorithm  of  a  sequential  nature  is  devised. 

Chapters  2  and  3  are  devoted  to  the  first  part  of  the  work. 
To  begin,  an  approximate  locally  optimal  scheme,  which  can  be 
considered  as  a  modified  linear  detector,  is  discussed.  This 
scheme  evolves  from  a  use  of  the  Edgeworth  series  as  a  repre¬ 
sentation  for  the  noise  density  function.  Convergence  of  the 
approximate  locally  optimal  statistic  to  the  parent  statistic 
is  also  discussed.  Furthermore,  the  Cornish-Fisher  inverse 
expansion  is  employed  to  investigate  the  effect  of  noise 
skewness  on  the  sample  mean  detector  and  the  sign  detector.  An 
asymptotic  relation  between  the  performance  and  the  skewness 
measure  is  given.  The  sign  detector  is  shown  to  be  less 
sensitive  than  the  sample  mean  detector  to  the  noise  skewness. 

A  modified  sample  mean  detector  is  devised. 

Chapters  4  and  5  address  signal  detection  in  bounded  noise. 
When  both  the  signal  and  the  bound  on  the  noise  magnitude  are 
known  constants,  the  sequential  procedure  will  lead  to  a 
singular  solution,  if  it  terminates.  A  more  practical  way  to 
consider  this  problem  is  to  assume  a  random  bound.  Then  a  test 
which  involves  randomness  in  its  performance  measures  (the 
false-alarm  rate,  the  power  and  the  error  probability)  will  be 
encountered.  When  the  signal  is  unknown,  the  problem  is  cast 
as  an  estimation  problem.  The  estimate  of  the  signal  strength 
is  used  to  decide  whether  the  signal  is  present.  A  generaliza¬ 
tion  of  the  problem  is  considered  in  Chapter  5  where  a  set- 
theoretic  formulation  is  used. 
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the  signal  is  unknown,  the  problem  is  cast  as  an  estimation  prob¬ 
lem.  The  estimate  of  the  signal  strength  is  used  to  decide 
whether  the  signal  is  present.  A  generalization  of  this  problem 
is  considered  in  Chapter  5  where  a  set-theoretic  formulation  is 
used. 


TABLE  OF  CONTENTS 


Chapter  1  -  Introduction 


Motivation .  1 

Report  Outline .  2 

References .  5 

Chapter  2  -  On  The  Application  of  Series  Expansions  to  Detection  Problems 

Introduction .  8 

Tchebycheff-Hermite  Polynomials  and  Gram-Charlier  Series .  8 

The  Edgeworth  Series  .  11 

Edgeworth  Series  Expansion  for  Locally  Optimum  Statistics  .  14 

Cornish-Fisher  Expansion  for  Test  Statistics .  18 

Conclusions .  20 

References .  30 

Appendix  2.A  .  32 

Appendix  2.B .  33 

Chapter  3  -  Signal  Detection  In  Nearly  Gaussian  Skewed  Noise 

Introduction .  37 

Preliminary  Results  .  39 

The  Sample  Mean  Detector  .  44 

The  Sign  Detector  .  47 

A  Modified  Sample  Mean  Detector .  53 

Simulation  Examples  .  55 

A  Case  Study  Using  Under-Ice  Ambient  Noise .  57 

Conclusions .  66 

References .  87 

Chapter  4  -  Detection  of  Sure  Signals  in  Noise  of  Bounded  Magnitude 

Introduction .  69 

Problem  Formulation .  71 

Constant  Known  Signals  in  Noise  of  Random  Bounds .  75 

Duality  Between  Hypothesis  Testings  And  Confidence  Regions .  94 

Detection  of  Unknown  Constant  Signals  in  Bounded  Noise  . :»  99 

Conclusions . 102 

References . , . 103 


Chapter  5  -  A  Set-Theoretic  Approach  to  Detection  Problems 


Introduction . 105 

A  Detection  Problem . 106 

Detection  of  Unknown  Vector-Signal  in  M-Variate  Bounded  Noise  108 

Performance  Evaluation  . 112 

Conclusions . 113 

References . 115 

Chapter  6  -  Conclusions 

Summary  of  Results . 116 

Further  Research . 117 


CHAPTER! 


INTRODUCTION 

1.1  Motivation 

Extraction  of  signals  from  a  noisy  environment  has  been  an  impor¬ 
tant  and  much  studied  problem  in  statistical  communication  theory.  In 
particular,  the  detection  problem  of  deciding  whether  or  not  a  signal  is 
present  by  taking  a  sequence  of  noise  contaminated  observations  has 
been  of  much  interest  to  communication  scientists  and  engineers.  Vari¬ 
ous  detection  schemes  have  been  discussed  in  a  broad  body  of  literature. 
In  classical  detection  theory,  parametric  detection  was  of  much  concern 
due  to  the  well  developed  theory  of  parametric  statistical  inference  [1]. 
The  Neyman-Pearson  optimal  detector  and  the  locally  optimal  detector 
are  two  canonical  examples.  However,  implementation  of  these 
(parametric)  optimal  schemes  usually  requires  a  fairly  complete 
knowledge  of  the  input  model.  In  practice,  this  knowledge  may  not  be 
available.  Also,  the  performance  may  be  sensitive  to  the  inaccuracy  of 
the  presumed  knowledge.  Some  attention  thus  has  to  be  diverted  to 
other  alternatives. 

Data  from  a  number  of  natural  environments  show  that  noise  is  often 
non-Gaussian  [2].  Nevertheless,  due  to  the  Central  Limit  Theorem.  Gaus¬ 
sian  noise  is  of  much  interest  and,  in  fact,  is  the  one  for  which  the  sim¬ 
plest  solution  can  be  obtained.  In  this  case,  it  is  well  known  that  the 
sample  mean  detector  is  the  Neyman-Pearson  optimal  detector  [3]. 
Thus,  in  most  practical  situations,  modifications  of  the  sample  mean 
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deiecior  may  yield  a  more  satisfactory  detector  performance.  In  other 
words,  these  modifications  could  have  "reasonably  good"  (or  nearly 
optimal  in  some  sense)  performance  for  the  presumed  input  model  and 
could  be  less  sensitive  to  model  deviations.  This  motivates  the  first  part 
of  the  report. 

Motivation  for  the  second  part  is  due  to  the  fact  that  a  precise  sta¬ 
tistical  model  of  the  noise  is  difficult  to  obtain  in  many  practical  situa¬ 
tions.  Bounds  on  the  magnitude  of  the  noise,  in  contrast,  may  be  more 
easily  obtained.  One  may  even  argue  that,  in  the  real  world,  there  exist 
no  distributions  with  infinite  support.  Hence  considerations  of  bounded 
noise  should  lead  to  more  practical  solutions.  As  a  matter  of  fact,  in  con¬ 
trol  theory,  bounded  noise  has  been  exploited  in  the  literature  for  consid¬ 
ering  estimation  problems  [4],  [5],  [6]. 

1.2  Report  Outline 

Throughout  this  report  the  noise  processes  are  assumed  to  be 
discrete-time  processes  which  may  be  obtained,  for  example,  from  the 
sampling  of  continuous-time  processes.  In  addition,  the  detection  prob¬ 
lems  considered  here  are  detection  of  constant  signals  in  additive  noise. 
It  should  also  be  noted  that  all  the  chapters  are  complete  by  themselves 
and  can  be  read  in  any  order.  The  report  is  organized  as  follows: 

Chapter  2  considers  series  expansions  for  the  test  statistics  of  two 
optimal  detection  schemes,  the  locally  optimal  statistic  and  the  sample- 
mean  statistic  which  is  Neyman-Pearson  optimal  for  the  Gaussian  case. 
For  the  former,  the  Edgeworth  series  is  employed  to  obtain  a  general 
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representation  of  the  noise  density  function.  The  locally  optimal  statistic 
is  then  evaluated  via  this  representation.  Since  only  a  few  lower  order 
moments  of  the  noise  are  required,  the  solutions  are  applicable  to  a  wider 
class  of  noise  distributions.  The  second  part  of  this  chapter  utilizes  the 
Comish-Fisher  inverse  expansion  for  a  sequence  of  i.i.d.  random  vari¬ 
ables.  The  sensitivity  of  the  sample  mean  detector  to  the  underlying 
noise  skewness  can  then  be  studied.  An  asymptotic  relation  between  the 
performance  (false-alarm,  power,  and  probability  of  error)  and  the  noise 
skewness  measure  is  obtained. 

The  Cornish-Fisher  expansion  is  used  again  in  Chapter  3  for  further 
investigation  of  the  effect  of  noise  skewness  on  detector  performance.  A 
modified  sample  mean  detector  is  proposed  whose  performance  is  asymp¬ 
totically  indifferent  to  noise  skewness.  Simulation  results  are  provided  to 
verify  the  analysis.  A  preliminary  case  study  of  a  natural  noise  environ¬ 
ment  is  also  given  here. 

The  latter  part  of  this  report  discusses  the  detection  problem 
from  a  rather  different  point  of  view.  Instead  of  noise  statistics,  bounds 
for  the  noise  magnitude  are  assumed  to  be  known  in  Chapter  4.  To  make 
the  problem  more  practical,  the  bound  is  assumed  to  be  a  random  vari¬ 
able  with  some  known  distribution.  Then  a  sequential  procedure  is  dis¬ 
cussed.  Detection  of  an  unknown  signal  in  bounded  noise  is  also 
addressed  in  this  chapter.  The  problem  is  cast  as  an  estimation  problem 
for  the  signal. 

Chapter  5  is  essentially  concerned  with  a  generalized  version  of  the 
problem  discussed  in  the  second  part  of  Chapter  4.  A  set-theoretic  for- 


mulation  of  the  problem  is  used  here  to  consider  a  "multi-channel"  detec¬ 
tion  problem.  The  results  presented  in  this  chapter  are  somewhat  prel¬ 
iminary  due  to  the  difficulty  involved  in  performance  evaluation.  Chapter 
6  concludes  this  report  and  discusses  some  possibilities  for  future 


References 


[1]  H.L  Van  Trees,  Detection,  Estimation,  and  Modulation  Theory,  Part  I. 
New  York:  John  Wiley,  1968. 

[2]  R.F.  Dwyer,  "Fram  II  Single  Channel  Ambient  Noise  Statistics," 

101st  Meeting  of  The  Acoustical  Society  of  America,  May,  1981. 

[3]  J.  Marcum,  "Statistical  Theory  of  Target  Detection  by  Puls  !7adar," 
IRE  Trans.  Information  Theory,  vol.  IT-6,  pp.  59-267,  19 

[4]  F.C.  Schweppe,  "Recursive  State  Estimation:  Unknown  t  Bounded 
Errors  and  System  Inputs,"  IEEE  Trans.  Automat.  Con  rol.  AC- 
13,  pp. 22-28,  1968. 

[5]  D.P.  Bertsekas  and  I.B.  Rhodes,  "Recursive  State  Estimation  for  a 
Set-Membership  Description  of  Uncertainty,"  IEEE  Trans,  on 
Automat.  Contr.,  Vol.  AC-16,  pp.  117-128,  1971. 

[6]  M.  Milanese  and  G.  Belforte,  ''Estimation  Theory  and  Uncertainty 
Intervals  Evaluation  in  Presence  of  Unknown  But  Bounded  Errors: 
Linear  Families  of  Models  and  Estimators,"  IEEE  Trans,  on  Automat. 
Contr.,  Vol.  AC-27,  pp.  408-414,  1982. 


CHAPTER  2 


ON  THE  APPLICATION  OF  SERIES  EXPANSIONS  TO  DETECTION  PROBLEMS 

2.1  Introduction 

The  problem  of  using  series  expansions  to  represent  an  unknown 
function  in  terms  of  a  known  function  has  been  of  much  interest  to  scien¬ 
tists  and  engineers.  In  the  literature  of  statistics,  seeking  for  a  general 
representation  of  probability  distribution  functions  or  density  functions 
has  been  an  extensively  studied  problem,  [1]  -  [4].  A  representation 
based  on  series  expansions  in  terms  of  the  moments  is  particularly 
relevant  when  sequences  of  independent  and  identically  distributed 
(i.i.d.)  random  variables  are  considered.  One  of  *he  most  common  series 
expansions  used  to  represent  an  unknown  probability  density  function  by 
a  known  density  function  is  the  Gram-Charlier  series.  The  Gram-Charlier 
series  is  an  expansion  in  a  series  of  orthogonal  polynomials,  i.e.,  the 
Tchebycheff-Hermite  polynomials,  which  are  derived  from  the  normal 
density  function.  Unfortunately,  straightforward  applications  of  the 
Gram-Charlier  series  do  not  usually  lead  to  satisfactory  solutions.  It  is 
well  known  in  the  literature  that  regrouping  of  the  Gram-Charlier  series, 
e.g.  the  Edgeworth  series  [5], [6],  will  yield  better  results;  although  gen¬ 
eralization  of  the  Gram-Charlier  series  is  also  a  possible  way  for  irnprov- 
ing  the  result  [7]. 

In  the  context  of  statistical  communicatioits,  the  method  of  series 
expansions  has  also  been  used  to  evaluate  approximately  the  probability 
of  errors  in  radar  detection  [8],  and  to  estimate  error  probabilities  in 
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digital  communication  systems  [9]  -  [12].  This  method  is  employed  in 
this  chapter  to  consider  detection  problems,  although  for  somewhat 
different  usages.  To  begin  with,  it  is  used  here  to  obtain  a  general 
representation  for  the  noise  density  function.  Then  the  locally  optimum 
detection  scheme  is  devised  based  on  this  representation.  The  essential 
purpose  of  this  approach  is  to  reduce  the  effect  of  small  model  deviations 
on  the  locally  optimal  detectors.  This  is  because  series  representation 
only  requires  a  knowledge  of  the  moments  of  the  noise  distribution.  Thus 
when  a  finite  number  of  terms  of  the  series  are  used,  the  result  can  be 
expected  to  hold  approximately  for  a  certain  family  of  noise  distribu¬ 
tions. 

The  other  utility  of  the  series  expansions  in  this  chapter  is  the  appli¬ 
cation  of  the  Cornish-Fisher  inverse  expansion  [13]  for  a  random  variable 
in  terms  of  its  moments.  This  expansion  is  used  to  obtain  a  representa¬ 
tion  for  a  test  statistic,  the  sample  mean  of  a  sequence  of  i.i.d.  random 
variables.  In  this  way,  a  study  relating  the  performance  of  the  sample 
mean  detector  to  the  underlying  noise  skewness  can  be  facilitated. 

This  chapter  is  organized  as  follows  :  the  next  section  discusses 
derivations  of  the  TchebychefT-Hermite  polynomials  and  the  Gram- 
Charlier  series.  Section  2.3  addresses  some  difficulties  associated  with 
the  Gram-Charlier  series  and  presents  a  regrouped  version  of  the  Gram- 
Charlier  series,  namely,  the  Edgeworth  series.  Section  2.4  considers  a 
detection  scheme  using  an  approximate  locally  optimal  statistic  which  is 
obtained  by  using  the  Edgeworth  series  representation  for  the  noise  den¬ 
sity  function.  Section  2.5  utilizes  the  Cornish-Fisher  inversion  series  for 
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the  sample  mean  of  a  sequence  of  i.i.d.  random  variables  to  study  the 
effect  of  noise  skewness  on  the  sample  mean  detector.  Section  2.6  con¬ 
cludes  this  chapter. 

2.2  Tchebycheff-Hennite  Polynomials  and  Gram-Charlies  Series 

Derivations  of  the  Gram-Charlier  series  can  be  found  in  many  stan¬ 
dard  statistics  texts  (see  e.g.  [5], [6]).  For  completeness  of  discussion, 
one  which  follows  mostly  from  [6]  is  given  here.  The  derivation  will  start 
with  the  definition  of  the  Tchebycheff-Hermite  polynomials.  Consider  a 
Gaussian  density  function  /o(*)  given  by 

-  vs (1) 

Taking  successive  derivatives  of  /<>(*)  with  respect  to  x  yields 

f'o  (*)  =  -  ^-/0(x) 
f"  o(*)  =  (fr-  0/o(*) 

/o(4)(x)  =  (  j-J--  6^-+  3)/o(x) 

/o(8)(x)  =  (  -^-+  10^1-  15^/o(x) 
etc.  The  Tchebycheff-Hermite  polynomial  is  defined  by  the  following 
equation 


ffo(*)  =  1 

*,<*)  =  f- 


Thus 


/  o(n)(*)  =  (-l)ncr“n//n(*)/o(x) 
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Hz(x)  =  1 

»«(*)  =  *r-  6^-+  3 
«5(x)  =  £-  io£+  wt 
and  so  on.  Now,  from  Eq.  (1), 


tx_  t» 

/ o(*  ” 0  =  /<>(*)  ®  ^ 

then,  by  Taylor's  series  expansion,  we  have 


/o(i-0  =  2  I^Vi,)(*) 

t«0 

=  £  4r^<(*)/o(*) 

<*ol! 

Thus,  a  general  formula  for  the  TchebychefT-Hermite  polynomials  is 


«*W-5r- 


n(2]  xn-2 


nW  in_4 


n 


[8]  _n-6 


2-1!  a*1-2  2^>2!  o*-4  23-3!  a*1"8 


(2) 


where 


n.Irl  ^n(n-l)(n— 2).....(n-r  +  l)  r’st  n, 

Apparently,  the  polynomials  are  of  degree  n  in  *  with  the  coefficient  of 

xn  being  unity.  Also,  they  are  orthogonal  over  the  interval  (— 00 ,  *»)  with 

respect  to  the  weighting  function  /  0(x),  namely 

fj0(x)Hn(x)Hn(z)<ix  =  {mn\  (3) 

where 


if  m*n 
if  m=n 


is  the  Kronecker  delta  function.  Furthermore,  successive  Tchebycheff- 
Hermite  polynomials  satisfy  the  following  relations  : 


rn(*)  =  T*»-,(*) 


Hn(x)  -  JtfB_,(x)  +  (n-l)//n_2(x)  =  0  (5) 

Now,  the  probability  density  function  /  (x)  of  a  random  variable  (with 
zero  mean  and  known  variance  a2)  can  be  represented  in  terms  of  the 
TchebychefT-Hermite  polynomials,  i.  e. 

/(*)  =  £  CiHi(x)f0(x)  (6) 

<■0 

This  is  the  so-called  Gram-Charlier  series.  Due  to  the  orthogonal  pro¬ 
perty  of  the  polynomials,  Eq.  (3),  the  coefficients  cr  in  Eq.  (6)  can  be 
determined  by  multiplying  both  sides  of  Eq.  (6)  by  Hr(x)  and  integrating 
the  products  from  -«  to  ».  Hence 


cr  =  -f_J(x)Hr(x)dx 


Thus 


c0=  1 

c  i  =  0 
c2  =  0 


_ 


etc.,  where  the  m^'s  are  the  i  -th  central  moments  of  /(x).  In  some 
situations,  the  coefficients  cr  may  be  better  represented  in  terms  of  the 
cumulants.  Thus, 


r  -  *8 

*“  120#* 

etc..  Then  the  series  expansion  for  /  (x)  is  given  by 

/  (*)  =  /o(*)[  1  +  ♦  |j<^r-  3)  »«(*)  + . ] 

«  /«(*)[  i  +  * 

W^*9(l)  + . '  (B* 

It  is  known  that  a  sufficient  condition  for  this  series  to  be  convergent  (at 
every  continuity  point  of  /  (*))  is  [5] 

f  e*,/4/  (x)dx  <  «  (10) 

Obviously,  an  infinite  series  is  somewhat  impractical.  It  has  to  be  trun¬ 
cated  at  some  point.  However,  truncation  of  the  series  will  result  in  some 
difficulties.  The  series  may  not  converge  fast  enough  so  that  the  trun¬ 
cated  series  will  be  a  satisfactory  representation  of  /(*).  Furthermore, 
the  order  of  magnitude  of  successive  terms  of  the  series  is  not  neces¬ 
sarily  monotone  decreasing.  These  problems  will  be  discussed  in  more 
detail  in  the  next  section. 

2.3  The  Edgeworth  Series 

As  discussed  in  the  previous  section,  the  Gram-Charlier  series,  Eq. 
(9),  does  not  necessarily  provide  a  good  approximation  to  the  probability 
density  function.  Generally  speaking,  a  truncated  version  of  the  Gram- 
Charlier  series  can  be  a  good  approximation  only  when  the  parent  proba¬ 
bility  density  function  is  nearly  Gaussian.  A  well  known  problem  is  that 
the  order  of  magnitude  of  the  terms  in  the  Gram-Charlier  series  is  not 
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steadily  decreasing.  To  be  more  specific,  suppose,  for  example,  that  one 
wants  to  represent  the  probability  density  function  of  the  sample  mean  of 
a  sequence  of  i.i.d.  random  variables  in  terms  of  the  Gaussian  den¬ 
sity  function.  Without  loss  of  generality,  it  is  assumed  here  that  the  dis¬ 
tribution  of  [Xi  \  has  zero  mean  and  unity  variance.  Now,  let 


Sm  ■  jfi  Xi 


From  Eq.  (9),  the  p.  d.  f.  of  Sy  can  be  represented  as  follows 


fs(s)  =/0(s)  [  1  +  ±-K'aH3(s)  +  JL/c'4/f4(s)  + 

+  720  {K'*  +  + . 3  (11) 

where 


_  «3 

3“  VM 

_  5i_ 

4"  M 

s  -  *s 
8“  AT3'2 

,  _  *6 


*  8  = 


Af2 


and  the  k\  are  the  cumulants  of  Sy  while  the  x*  are  the  cumulants  of  Xt. 
In  fact,  if  Eq.  (11)  is  to  be  written  as  follows 


/s(s)  =  £  / o(s)cr  Hr(s) 

r-0 


(12) 


then  a  general  formula  for  cr  is  [5] 


_  o,,Af  +  ar8»1!+  •  •• 

r - TTir7*  <13) 

where  [r/3]  denotes  the  greatest  integer  sr/3,  and  the  a*  are  polyno¬ 
mials  in  K,..  The  a*  are  independent  of  M.  Also,  as  Af  tends  to  infinity, 


cr  =  0  (  )  (14) 

Thus,  the  order  of  magnitude  of  the  terms  of  the  Gram-Char lier 
series  is  clearly  not  steadily  decreasing  as  r  increases.  The  following 
table  illustrates  the  order  of  magnitude  in  terms  of  powers  of  M  that  the 


coefficients  cr  involve. 


Order  of  cr 

AT 1/2 
AT1 
ATVZ 
A/"8 
AT" 5/2 


3 

4,6 

5,7,0 

0,10,12 

11,13,15 


Now,  suppose  that  a  partial  sum  of  the  series,  Eq.  (12),  is  to  be  calculated 
such  that  all  terms  of  magnitude  order  less  than  Af-1  are  truncated. 
Then,  it  can  be  seen  from  the  table  that  all  terms  up  to  r  =6  should  be 
considered.  Thus,  the  moments  up  to  sixth  order  will  be  needed.  How¬ 
ever.  a  careful  examination  of  Eqs.  (12)  and  (13)  reveals  that  the 
moments  of  order  higher  than  the  fourth  really  do  not  involve  terms  of 
magnitude  order  greater  than  M~l.  Therefore,  the  requirement  of  the 
moments  up  to  sixth  order  is  unnecessary  if  all  terms  of  Order  less  than 
AT*1  are  to  be  neglected.  Similarly,  if  one  proceeds  further  to  include 
terms  containing  the  factors  A/-vz,  A/-2,  etc.,  the  same  redundancy  will 
be  encountered. 

From  the  above  discussion,  it  is  clear  that  a  regrouping  of  terms  in 
the  Gram-Charlier  series  will  improve  the  efficiency  of  the  approximation. 
The  Edgeworth  series,  which  is  a  regrouped  version  of  the  Gram-Charlier 
series,  actually  provides  more  satisfactory  solutions.  A  general  form  of 


the  Edgeworth  series  expansion  for  an  unknown  density  function  in  terms 
of  its  moments  and  the  normal  density  function  is  given  in  the  following 

/(*)  =  /o(*)  I  1  +  cs#s(*)  +  c4H4(x)  +  ce#e(*)  + 

C  $H  g(z  )  +  CyHy(x)  +  CgHg(x)  + . ]  (15) 

where  the  c/s  are  given  in  Eq.(8).  As  can  be  seen  from  Eq.  (15),  the 

order  of  terms  in  the  Edgeworth  series  follows  directly  those  given  in  the 
table  of  the  last  page.  One  can  see  that  this  series  gives  a  straightfor¬ 
ward  expansion  in  powers  of  M~x/*  .  Also,  calculation  of  the  terms  in  the 
Edgeworth  series  up  to  a  certain  order  of  magnitude  does  not  require  a 
knowledge  of  any  moments  or  cumulants  that  are  not  really  necessary. 
Thus  the  redundancy  involved  in  the  G ram-Char lier  series  discussed  pre¬ 
viously  is  eliminated.  Furthermore,  terms  of  the  Edgeworth  series  should 
be  taken  by  groups  of  the  same  magnitude  order;  thus  any  partial  sum  ol 
this  series  is  an  asymptotic  expansion  of  the  parent  density  function  in 
powers  of  M~wz,  with  a  remainder  term  of  the  same  order  as  the  first 
term  neglected.  More  complete  discussions  on  the  derivation  of  the 
Edgeworth  series  can  be  found  in  [5],[6]. 

2.4  Edgeworth  Series  Expansion  far  Locally  Optimum  Statistics 

In  detection  problems,  optimal  detection  schemes  usually  require  a 
large  amount  of  knowledge  concerning  the  noise  environment.  Also,  it  is 
not  unusual  that  these  optimal  schemes  are  complicated,  and  not  easy  to 
implement.  Furthermore,  the  performance  may  be  sensitive  to  the  accu¬ 
racy  of  the  presumed  noise  statistics.  Thus  alternatives  that  involve  less 
complexity  and  less  sensitivity  are  always  desirable.  The  locally  optimal 


detectors  are  known  to  have  maximum  rate  of  increase  in  the  probability 
of  detection  when  the  signal  strength  is  equal  to  zero.  Hence,  they  are 
useful  schemes  when  the  signal  strength  is  small.  However,  their  imple¬ 
mentation  require  a  complete  knowledge  of  the  underlying  noise  density 
function  and  the  test  statistics  become  complicated  or  even  intractable 
very  easily.  In  applications  where  the  noise  density  function  is  only 
known  to  be  nearly  Gaussian  (or,  specifically,  it  satisfies  Eq.  (10)),  the 
Edgeworth  series  expansion  discussed  in  the  previous  section  may  be 
used  as  a  general  representation  of  the  noise  density  function.  Then  the 
locally  optimum  test  statistics  can  be  obtained  based  on  this  representa¬ 
tion  of  the  density  function.  The  result  thus  obtained  is  expected  to  be 
less  sensitive  to  the  deviations  of  noise  statistics  as  only  the  lower-order 
moments  of  the  noise  distribution  are  required  to  obtain  this  representa¬ 
tion. 

Suppose  that  the  noise  distribution  has  zero  mean  and  unity  variance 
(a  generalization  of  the  following  discussion  with  assumption  of  an  arbi¬ 
trary  variance  is  trivial).  Now  consider  a  series  representation,  y  (x),  up 
to  a  second  order  approximation  of  the  noise  density  function  /(x)  such 
that 

I  /  (x)  -  g(x)  |  <  for  almost  all  x  €  (-«,«)  , 

where  C  is  a  constant.  Then,  from  Eq.  (15),  the  noise  density  function  can 
be  represented  by 

f  (*)  =  /o(*)[  1  +  c s//s(*)  ♦  caha (*)  ♦  cetfe(*)  ]  (1®) 

where 
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is  the  standard  normal  density  function.  Now  the  derivative  of  g(x)  is 
equal  to 

g'(*)  *  /  ' o(*)  [  1  +  C3/f3(*)  +  c4//4(x)  +  ce//6(z)  ]  +  .(1?) 

/o(*)  [  3c s H2(x)  +  4c  ^(z)  +  6 CtHs(x)  ] 

If  it  is  further  assumed  that  the  noise  density  function  is  symmetric  with 

respect  to  the  origin,  then  c3  =  0  and  Eqs.  (16)  and  (17)  become 

g( *)  =  /o(*)  [  1  +  c4^4(*)  +  c6#e(*)  ]  (18) 

and 


g'(*)  =  /'<>(*)  [  1  +  c4tf4(z)  +  ce//6(z)  ]  +  (19) 

J0(x)  [4c4tf3(z)  +  6c*H8(z)  ] 

Now  the  locally  optimal  statistic  can  be  approximated  by 


Ttx)  =  -  g '(*)..=  *  _  Jc4//a(z)  46c^a(«)_| 
g(x)  1  +  c4W4(*)  +  C3«8(Z) 

The  second  term  in  Eq.  (20)  can  be  regarded  as  a  correction  term  due  to 


the  deviation  from  the  Gaussian  model.  The  performance  of  this  non¬ 


linear  detection  scheme  is  evaluated  here  via  the  asymptotic  relative 
efficiency  (  ARE  )  with  respect  to  the  linear  detector.  The  ARE  of  this 


approximated  locally  optimal  detector  comparing  to  the  linear  detector 
is.  [14], 


if  <^(x))/(x)dx] 

ARE+tM  =  — =■ 

Since  T(x)  is  an  odd  function  in  x,  Eq.  (20).  and  it  is  assumed  that  /‘(x)  is 
symmetric  with  respect  to  the  origin,  thus  E[T(x)]  =  0.  .Furthermore,  if 
one  assumes  that 


then 


lim  xn  / (x)  =  0  for  any  integer  n. 


(22) 
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B 


arf  [/  r(»)/’(x)te]» 

AREatoi'U  -J  VarfrU)] 

On  the  other  hand,  it  can  be  shown  easily  that  the  ARE  of  the  locally 
optimal  detector,  as  compared  to  the  linear  detector  is  given  by 


MB***  =  f  1r^-dx  ■ 

if  Eq.  (22)  holds. 

From  Eq.  (20),  it  is  trivial  to  see  that 

or2  ^  Var[T(X)]  -  1  -  ZE[Xh(X)]  +  E[h?{X)] 


where 


h(Y v  _  4c 3(x )  +  6cg//8(z) 

'  "  1  +  caHAx)  +  c bHmIx') 


caH  4(*)  +  c«//8(x) 

By  the  Central  Limit  Theorem,  it  can  be  shown  that  (Appendix  2. A) 


therefore, 


Note  that 


AREqimim  ~  z 

jt 


ARE  >  1  oT  <  1  **  E[K\X )]  <  ZE[Xh(X)] 


(23) 


and 


E  (fc*(J0|  =  f  h\x)J  (i)<fa  =  2  /  h*(*)/  (i)dx 
—  0 


E\Xh(X)\  =  f  zA(x)/(z)dx  =  2  f  xh(x)f  (x)dx 

—  o 

also,  /(z)  is  a  p.d.f.  and  is  thus  non-negative  for  z  e  (-»,«»).  Hence  a 
sufficient  condition  for  the  AAE’om.m  t°  be  greater  than  unity  is 

2zA(z)  >  A2(z)  V*ze(0,  «*>)  (24) 

However,  it  is  not  necessarily  true  that  Eq.  (24)  holds.  Detailed  discus¬ 
sions  on  Eq.  (24)  are  given  in  Appendix  2.B.  It  can  be  expected  .neverthe¬ 
less,  that  when  the  noise  model  is  nearly  Gaussian,  the  AREalVft  M  is 


4 


J 


J 


greater  than  unity. 

Finally,  it  should  be  noted  that,  in  general,  it  is  not  necessarily  true 
that  £'(*)•  Eq.  (17),  will  converge  to  /'(*)  even  if  g(x)  converges  to  /(x). 
A  sufficient  condition  to  be  satisfied  is  the  Tauberian  condition  [15]  which 
requires  that/'(x)  be  monotone  nondecreasing.  Unfortunately,  for  some 
density  functions,  this  condition  may  not  be  satisfied.  However,  practi¬ 
cally  speaking,  convergence  is  an  asymptotic  property  which  does  not 
usually  provide  enough  information  as  to  how  well  the  series  expansion 
approximates  the  parent  function,  especially  when  only  a  finite  number 
(or  even  a  small  number)  of  terms  are  used.  In  other  words,  an  essential 
issue  is  whether  or  not  the  expansion  yields  a  good  approximation  to  the 
parent  function.  Moreover,  in  detection  problems,  it  is  of  more  concern 
that  the  resulting  detection  scheme  leads  to  a  good  performance  meas¬ 
ure,  e.g.  an  ARE  greater  than  unity.  Figs.  2.A  and  2.B  show  curves 
describing  the  T(x)  given  in  Eq.  (19),  together  with  the  test  statistics  of 
the  locally  optimum  and  the  linear  detectors  using  mixture  noise  models. 
These  models  are  Gaussian  slightly  contaminated  with  Laplace  noise.  The 
AREajbdM  f°r  these  two  examples  are  greater  than  one;  although  the  T{x) 
is  a  good  approximation  to  the  locally  optimal  statistic  only  in  the  neigh¬ 
borhood  of  the  origin.  This  result  provides  some  positive  illustration  of 
applications  of  the  method  discussed  here. 

2.5  Cornish-Fisher  Expansion  for  Test  Statistics 

In  this  section,  the  Cornish-Fisher  expansion  of  a  random  variable  in 
terms  of  its  moments,  [13],  will  be  employed  for  the  sample  mean  of  a 


Fig.  2.A  Locally  Optimal  Statistics  For  Mixture  Model 


/(*)=(! 


of 
e  =  0.05 


r2  _  _2_ 

9  “  TT 
cr 


-=0.8 


7 

AREa icd, id  ~  1-001 


sequence  of  i.i.d.  random  variables.  In  detection  problems,  this  sample 
mean  is  known  to  be  the  test  statistic  of  the  sample  mean  detector  which 
is  the  Neyman-Pearson  optimal  detector  when  the  underlying  noise  is 
Gaussian.  However,  when  the  noise  density  is  not  Gaussian  due  to  skew* 
ness,  the  performance  of  the  linear  detector  may  deteriorate.  Little 
work  has  been  done  on  relating  performance  of  the  sample  mean  detec¬ 
tor  to  noise  skewness,  partly  because  the  conventional  skewness  measure 
[6]  does  not  appear  explicitly  in  the  likelihood  ratio  or  the  test  statistics. 
The  objective  here  is  to  use  a  series  expansion  for  the  test  statistic  in 
terms  of  its  moments  such  that  an  explicit  relation  between  the  noise 
skewness  and  the  performance  of  the  detector  can  be  obtained.  In  order 
to  facilitate  the  study,  the  underlying  noise  will  be  assumed  to  be  only 
slightly  skewed  and  nearly  Gaussian.  This  assumption  enables  us  to  use 
the  first  few  terms  of  the  Cornish-  Fisher  expansion  and  obtain  a  good 
approximation  to  the  test  statistic. 

Now  the  following  canonical  binary  hypothesis  testing  problem  will  be 
considered: 

H0:Xi=Ni 

Hx:  Xt  =  Nt  +  s,  s> 0  i=1,2 . M 

It  is  assumed  here  that,  under  Hq,  the  p.  d.  f.  of  X *  is  /(x*)  with  zero 
mean,  known  variance  a2  and  known  third-order  central  moment  fjq, 
where  Ms  is  assumed  to  be  small.  Furthermore,  the  observation  sequence 
\Xi }  is  supposed  to  be  independent  and  identically  distributed.  Now,  con¬ 
sider  the  following  test  statistic 
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This  T(X)  can  be  written  in  terms  of  the  Comish-Fisher  expansion, 
namely, 

under  H0 :  T(X)  =  -^=-Z  +  g|^-(Z2  -  D  +  GfAT8'2) 
and  (25) 

under  Hx :  7U)  =  -^~Z  +  s  +  ^<Z2  -  D  + 
where  Z  is  a  standard  normal  random  variable  and  0(Af_3/2)  stands  for 
the  terms  which  converge  to  zero  as  fast  as  M~z/z  when  M  tends  to 
infinity. 

It  is  clear  that,  in  order  to  make  the  problem  tractable,  the  series  in 
Eq.  (25)  has  to  be  truncated.  Since  it  has  been  assumed  that  n$  is  small, 
the  truncated  series  will  be  a  good  approximation  to  T{X).  Now,  define 
two  random  sequences  f  y0(AO{  and  \Yi(M)\  by 


and 


WO  = 


H 3 

6c2  A/ 


(*2  -  1) 


(26) 


W=^Z+s  +  i^-(Z2-l) 

Then,  it  can  be  shown  that  [Y0(M)\  and  [Y\(M)\  converge  to  T(X)  almost 
surely  under  H0  and  Hx  respectively.  Hence  the  performance  of  the 
detector  which  uses  T{X)  as  the  test  statistic  can  be  evaluated  approxi¬ 
mately  via  the  statistics  of  [Yq(M)\  and  [Yx(li)\,  provided  that  M  is 
sufficiently  large. 

Let  and  B-a/  'ZM ,  then  Eq.  (26)  becomes 

Y0(M)  =  AZz  +  BZ  -  A  and  YX(M)  =  AZz  +  BZ  -A  +  s.  Note  that  Z  is  a 
standard  normal  random  variable.  Thus  the  probability  density  functions 
of  Yq  and  Yx  are  given  by 


fra(y) 


=  0  otherwise 


where 


Q(y)  =  V52  +  4i4z  +  4i4y 


/ y,  ( V )  =/y0(v-s) 


The  false-alarm  rate  a  is  given  by 

a  =  f  fy0(y) d-y 

*Q 

where  Tq  is  the  threshold.  From  Eq.  (28),  it  follows  that 


dQ(y)  -  2 A  Q~Hy)  dy 

Substituting  Eqs.  (27)  and  (30)  into  Eq.  (29)  yields 

-  -  Cg  + 

•C.) 


Thus 


where 


./  B  \x/  b  \x/ 


r*  £  ^(r0) 

Similarly,  the  power  0  is  given  by 

«5ghS, 

If  the  skewness  is  defined  as 


*«  * L 
f  o3 

then,  it  is  obvious  that  both  a  and  p  are  dependent  upon  the  noise 
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skewness,  via  the  variables 


£2— 


9(7q-s) 

24 


In  particular, 


Fig.  2.C  gives  a  set  of  curves  which  depicts  the  normalized  rate  of  change 
in  false-alarm  rate,  power,  and  probability  of  error  as  functions  of  the 
noise  skewness.  It  is  assumed  that  the  underlying  noise  density,  which  is 
described  by  Eq.  (27),  is  positively  skewed.  One  can  see  that  both  the 
false-alarm  rate  and  the  probability  of  error  are  monotone  increasing 
functions  of  the  noise  skewness  £.  On  the  other  hand,  the  power  stays 
nearly  unchanged.  This  illustrates  the  performance  deterioration  of  the 
sample  mean  detector  when  the  underlying  noise  is  skewed.  A  density 
function,  Eq.  (27),  with  a  skewness  measure  £  =  0.6  is  shown  in  Fig.  2.D 
together  with  the  corresponding  Gaussian  density  function.  For  the  ini¬ 
tial  false-alarm  rate  a  =  10-s,  this  noise  density  function  corresponds  to 
a  100%  increase  in  false-alarm  rate.  Although  these  two  density  functions 
are  hardly  distinguishable,  from  Fig.  2.D,  the  difference  between  their 
associated  false-alarm  rates  is  rather  significant.  Fig.  2.E  plots  the 
Neyman-Pearson  optimal  statistics  associated  with  these  two  noise 
models  given  in  Fig.  2.D. 

In  Eq.  (27),  by  use  of  the  L’Hospital’s  rule,  it  can  be  shown  that 


Kin/y0(v)  =p(y) 
(■*0  " 

where 


9(v)  = 


V2n  Bz 


■exp[— 5-(£)2]  for 


and  thus 
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Fig.  2.C  Rate  of  Change  In  False-alarm  Rate,  Power, 
and  Probability  Of  Error 
Sample  Size  M  -  100 
a0  =  1CT10 


Fig.  2.D  Probability  Density  Functions  of  Eq.  (27) 

- Gaussian  (  (  -  0  ) 

- Skewed  (  £  =  0.6 ) 


Um  o  =  $( — and  lim  0  =  *( — 

(•*0  a  C-*o  a 

which  can  also  be  derived  easily  from  Eqs.  (31)  and  (32)  by  L'Hospital’s 


Furthermore,  it  is  interesting  to  note  that  the  false-alarm  rate  and 
the  power  are  bounded.  By  observing  that 

♦(x)-#(y)  <  $(x -y)  <  $(x+y)  <  $(x)  +  #(y),  y  0 


one  can  show  that 


$(x+y)  <$(x~y),  y  <  0 


Tq+B.  Tq—B . 


H-SZ -sgt^.'.H-SZsekS) 

In  fact,  if  7*0  >  s  -A  -  s  -  the  power  /S  is  upper  bounded  by  0.5.  Note, 

D  M 

however,  when  the  sample  size  M  is  large  and  the  skewness  measure  £  is 
small,  the  case  Tq  >  s- A  is  unlikely  to  happen. 


2.0  Conclusions 


Two  applications  of  series  expansions  for  detection  problems  are  dis¬ 
cussed  in  this  chapter,  the  Edgeworth  series  and  the  Cornish-Fisher 
inverse  expansion.  For  the  former,  an  approximation  to  the  locally 
optimum  test  statistic  is  evaluated  by  using  the  Edgeworth  expansion  to 
represent  the  noise  density  function.  This  approximation  is  not 
guaranteed  to  converge  to  the  locally  optimum  statistic,  unless  the  Tau- 
berian  condition  is  satisfied.  Nevertheless,  it  is  applicable  to  a  certain 
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family  of  noise  distributions  since  only  the  first  few  moments  are 
required  to  obtain  this  approximation.  Moreover,  the  scheme  thus  dev¬ 
ised  may  still  yield  good  detector  performance.  More  sophisticated 
series  expansions  may  be  introduced  to  achieve  better  solutions.  In  the 
latter  part  of  this  chapter,  an  application  of  the  Cornish-Fisher  expansion 
is  demonstrated  in  the  study  of  the  effect  of  noise  skewness  on  the  sam¬ 
ple  mean  detector.  The  sample  mean  detector  is  shown  to  be  sensitive  to 
the  noise  skewness. 
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Appendix  2.A 

Derivation  of  Equation  (23) 

The  notations  used  in  the  following  discussion  are  defined  in  Section 
4.  Suppose  that  the  observation  sequence  is  a  sequence  of  i.i.d.  random 
variables,  and  that  the  sample  size  M  is  sufficiently  large  so  that  the  Cen¬ 
tral  Limit  Theorem  can  be  applied  here.  Now,  define 

T(x)  4  £  T(ii) 

i*  1 

Then  the  distribution  of  T(x),  by  the  Central  Limit  Theorem,  is  approxi¬ 
mately  a  normal  distribution  with  zero  mean  and  variance  Mof,  where 
of  =  Var  [TiX,)]  . 

The  threshold,  T0,  and  the  power,  p,  of  this  detection  scheme  can 
thus  be  determined  with  a  prescribed  false-alarm  rate  a  as  follows  : 


a  »  -7==j - f  exp[-**/2J/<'^  dx  -  l  -  *[  -7=? — ] 

V27T Mot  JTa  vA/ffr 

Hence  the  threshold  for  the  given  false-alarm  rate  a  is  equal  to 


Tq  =  Of  (1-a) 

Also,  the  power  p  is 

fi  =  *[  'SM  -2 — $-1(l-a)] 

Of 

On  the  other  hand,  the  power  Pi  of  the  linear  detector  using  the  same 
threshold  and  false-alarm  rate  is  given  by 


Pi  =  $[  V77s  -  $_1(l-a)] 

Thus  the  ARE  of  the  nonlinear  scheme  compared  to  the  linear  detector  is 


AREuod.id  - 


lim 

,N-< 


M__  JL 
N  of 


with  s  -*  0,  and  p  -  Pi 


K 


— ,  '  *  'O  ’ 

Z5  :  5k4  -  3k6  =  0 

Zg  :  Kg  =  0 

Now,  consider,  for  example,  the  following  noise  model : 

/(x)  =  (l-e)/j(x)  +  e/2(x) 

where 


Then  clearly, 


/ 1(») 
/2(*) 


1 


g  -x*/2 

— V5|*  | 


and 


*4  =  3e 


k6  =  30c 


k 4  =  with  0  s *4  *  3  .  (since  0  =s  as  1  )  . 

The  lines  Z0  to  Z6  and  their  corresponding  signsd-half  spaces  are  shown 
on  Fig.  2.F.  From  analytic  geometry,  one  can  see  that 

f  a0,  a\,  °2,  °3,  a6,  a7  j  ^  0 

and 


(  a4i  o5  }  s  0 

According  to  Descartes’  law  of  signs  [16],  we  conclude  that  both  the 
denominator  and  numerator  will  have  either  no  positive  real  roots  (p.r.r.) 
or  two  p.r.r.’s.  If  either  the  numerator  or  the  denominator  has  two  p.r.r., 
then  (2x-/i(x))  <  0  holds  in  some  subinterval  of  (0,  In  this  case, 
h(x)  >  0,  thus  we  have 


2x/i(x)  <  h2(x) 

On  the  other  hand,  if  neither  the  denominator  nor  the  numerator  has 
p.r.r.,  then  (2x-/i(x))  >  0,  for  all  x  e  [0,»]  .  Nevertheless,  h(x)  may  be 
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negative  for  some  x,  so  (2x— h.(x))/i(x)  is  not  always  nonnegative  even  if 
ZxK{x)  <  hz(x). 

The  above  discussion  actually  illustrates  one  way  of  investigating  the 
performance  of  the  approximate  locally  optimum  detector  using  the 
knowledge  on  the  moments.  Further  study  would  be  required  if  a  neces¬ 
sary  and  sufficient  condition  for  ARE^fli  M  >  1  is  desired. 


CHAPTER  3 


SIGNAL  DETECTION  IN  NEARLY  GAUSSIAN  SKEWED  NOISE 

3.1  Introduction 

Much,  if  not  most,  of  the  large  literature  on  signal  detection  has  been 
based  on  the  assumption  that  the  underlying  noise  is  symmetrically  dis¬ 
tributed  with  respect  to  the  mean.  Typical  examples  include  the  sample 
mean  detector  and  most  robust  detectors  [1]  -  [3].  This  assumption, 
although  generally  making  problems  more  tractable  and  providing 
simpler  solutions,  may  not  be  exactly  true  in  practice.  In  such  applica¬ 
tions  as  sonar,  asymmetrically  distributed  noise  which  might  result,  for 
example,  from  underwater  reverberation  or  sea  clutter,  has  been  encoun¬ 
tered  [4].  Furthermore,  data  from  a  number  of  natural  environments, 
e.g.  under-ice  ambient  noise  [5],  have  actually  shown  asymmetrical  sta¬ 
tistical  properties.  In  such  situations,  optimal  detectors  could  still  be 
found,  but  there  are  some  additional  difficulties  in  the  implementation  of 
such  detection  schemes.  The  test  statistics  are  usually  complicated  or 
intractable  and  to  implement  them  precisely  requires,  of  course,  a  com¬ 
plete  knowledge  of  the  underlying  noise  density  functions.  Furthermore, 
the  performance  of  these  detection  schemes  may  be  sensitive  to  the 
inaccuracy  of  the  presumed  statistics,  and  in  particular,  to  the  exact 
amount  of  skewness. 

In  a  recent  paper,  Kassam  et.  al.  [6]  discussed  a  robust  detection 
problem  using  a  noise  model  which  allows  a  symmetric  contaminated- 
nominal  central  part  and  an  arbitrary  tail  behavior.  Except  for  this 


example,  the  problem  of  detection  with  skewed  noise  does  not  seem  to 
have  been  discussed  extensively  in  the  signal  detection  literature.  How¬ 
ever,  in  the  context  of  other  statistical  applications,  there  has  been  some 
work  on  investigating  the  effect  of  population  skewness  on  hypothesis 
testings  or  estimation  problems.  Johnson  [7]  discussed  the  effect  of 
population  skewness  on  the  t-variable  and  proposed  a  procedure  to 
modify  the  t-variable  so  that  this  effect  can  be  reduced.  Carrol  [8]  exam¬ 
ined  the  effects  of  asymmetry  on  estimates  of  variance  of  robust  esti¬ 
mates  in  location  and  regression  problems  and  showed  that  heavy  skew¬ 
ness  of  errors  can  seriously  bias  the  commonly  used  estimates  for  loca¬ 
tion  and  intercept.  The  purpose  of  this  chapter  is  to  study  the  effect  of 
noise  skewness  on  the  performance  of  the  sample  mean  detector  and  the 
sign  detector  and,  then,  to  examine  a  modification  of  the  sample  mean 
detector  which  is  less  affected  by  the  noise  skewness.  Evaluation  of  the 
detectors  will  be  based  on  the  false-alarm  rate,  the  power,  and  the  proba¬ 
bility  of  error. 

There  are  many  ways  to  define  the  skewness  measure.  Throughout 
this  chapter,  this  measure  is  taken  to  be  the  third-order  oentral  moment 
divided  by  the  cube  of  the  standard  deviation.  This  definition  is  one  of 
the  most  commonly  used.  By  introducing  an  asymptotic  expansion  for 
the  test  statistics,  the  detector  performance  can  be  related  explicitly  to 
this  measure.  In  order  to  facilitate  analysis,  the  test  statistics  will  be 
normalized  by  the  sample  size  M.  This  normalization  procedure  will  not 
affect  the  performance  of  the  detectors  so  long  as  the  thresholds  are 
adjusted  correspondingly.  Then  the  relation  between  the  performance 
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and  the  noise  skewness  is  obtained.  This  relation  will  give  a  clear  qualita¬ 
tive  insight  into  detector  performance,  although,  for  any  finite  M,  it  will 
be  an  approximation.  It  will  turn  out  that  the  sample  mean  detector  is 
more  sensitive  to  noise  skewness  than  the  sign  detector,  when  the  under¬ 
lying  noise  is  only  sightly  non-Gaussian  due  to  skewness.  This  result  is 
reasonable  since  the  sign  detector  only  assumes  a  zero  median,  a  milder 
assumption  than  symmetry,  for  the  noise  distribution.  The  second  part 
of  this  chapter  then  proposes  a  modified  scheme  for  the  sample  mean 
detector,  which,  by  the  introduction  of  a  correction  term,  reduces  the 
effect  of  noise  skewness. 

Curves  which  describe  the  performance  change  with  respect  to  noise 
skewness  measure  for  both  the  sample  mean  detector  and  the  sign  detec¬ 
tor  are  given  in  Figs.  3.A  -  3.C.  Specifically,  Fig.  3.A  shows  rates  of  change 
in  the  false-alarm  rate  for  both  detectors,  Fig.  3.B  depicts  those  in  the 
power,  and  Fig.  3.C  is  for  the  probability  of  error.  Note  that,  in  these 
figures,  the  thresholds  are  kept  constant  as  the  skewness  measure  varies. 
Fig.  3.D  illustrates  the  structure  of  the  modified  sample  mean  detector. 
In  the  last  part  of  this  chapter,  some  numerical  examples  based  on 
Monte-Carlo  simulations  and  on  some  data  from  under-ice  ambient  noise 
are  given. 

3.2  Preliminary  Results 

One  of  the  major  difficulties  in  examining  conventional  detectors  with 
asymmetrically  distributed  noise  has  been  that  the  skewness  measure 
does  not  usually  appear  explicitly  in  the  test  statistics  and  thus  perfor- 


mance  of  the  detector  can  hardly  be  expressed  in  terms  of  this  measure. 
Although  several  different  skewness  measures  are  available  in  the  litera¬ 
ture  [9], [10],  this  difficulty  remains  unchanged.  The  reason  appears  to 
be  that,  in  general,  skewness  measures  are  evaluated  as  a  function  of  the 
third-order  moment,  a  function  of  the  mean  and  the  mode  or  the  median, 
or  even  as  a  function  of  the  kurtosis  and  the  third-order  moment.  On  the 
other  hand,  the  test  statistics  usually  evolve  from  the  probability  density 
functions  (p.d.f.)  or  from  other  functions  of  the  distribution  which  may 
not  be  related  explicitly  to  those  skewness  measures  mentioned  above. 
Hence,  even  though  it  is  intuitively  clear  that  the  statistical  properties  of 
the  test  statistics  are  affected  by  the  noise  skewness,  the  explicit  func¬ 
tional  relations  are  hard  to  acquire.  One  reasonable  way  to  get  around 
this  problem  is  to  obtain  an  asymptotic  expansion  for  the  test  statistic  in 
terms  of  the  moments  which  are  more  clearly  related  to  the  usual  skew¬ 
ness  measures. 

The  main  purpose  of  this  section  is  to  develop  some  preliminary 
results  on  which  the  procedure  of  expanding  the  test  statistics  will  be 
based.  There  are  several  general  ways  to  obtain  an  asymptotic  expansion 
for  a  random  variable.  The  one  to  which  attention  here  will  be  restricted 
is  the  use  of  the  Cornish-Fisher  expansion  [11].  A  general  form  of  such  an 
expansion  for  a  random  variable  X  is  given  by 

CF(X)  =  Mo  +  oZ  +  ^Z*  -  1)  +  •  •  •  (1) 

where  ^=E[X},  o2=E|(A,-/i0)2J.  ^-E\{X-^%  and  Z  is  a  standard  nor¬ 
mal  random  variable.  In  this  chapter,  our  interest  is  to  employ  this  sort 
of  expansion  for  the  sample  mean  of  a  sequence  of  independent  and 
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identically  distributed  (i.i.d.)  random  variables  so  that  some  asymptotic 
properties  can  be  achieved.  To  begin  with,  the  asymptotic  expansion  for 
the  sample  mean  should  be  defined.  Let  (A*  {  be  a  sequence  of  i.i.d.  ran- 

dom  variables  with  sample  mean  Sy= Xi,  where  M  denotes  the  sam- 

"<■  1 


pie  size.  We  will  call  a  series  of  random  variables,  o0+ £ 

i»J 

"asymptotic  expansion  valid  to  r  terms"  for  Sy  if 


°i 

Mi/Z 


Zit  an 


\Su-a0-  t  -jjfe-Zi  \  =  o  (M~r/Z)  w.p.  1.  (2) 

This  definition  evolves  from  Wallace  [12]  who  defined  an  asymptotic 
expansion  valid  to  r  terms  for  a  distribution  function  while  requiring  the 
remainder  to  be  0(Af“r/2).  However,  it  will  be  seen  later  that  requiring 
the  remainder  to  be  o(M~r/z)  as  in  Eq.(2)  is  more  convenient  and  furth¬ 
ermore,  is  in  accordance  with  Erdelyi  [13].  It  is  clear  that,  when  r  =1,  if 
a0  is  taken  to  be  the  mean  of  Xi  and  Z\  is  the  standard  normal  random 
variable,  then  Eq.(2)  is  compatible  with  the  ordinary  Central  Limit 
Theorem.  Furthermore,  one  can  argue  that  the  Central  Limit  Theorem 
suggests  that,  under  some  general  conditions,  the  distribution  of  Sy 
tends  to  be  symmetric.  However,  it  will  be  shown  here  that,  by  introduc¬ 
ing  the  series  expansion  for  the  test  statistics,  one  can  obtain  a  solution 
with  faster  convergence  rate.  Now,  let  o2  and  be  the  variance  and  the 
third-order  central  moment  of  Xi  respectively,  and  let  the  skewness 
measure  {  be  defined  by 


Then,  it  is  easy  to  see  that  the  skewness  measure  $y  of  the  distribution  of 
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Sy  is  given  by 

tu  =  0  as  M  -*oo 

V77  a3 

These  arguments  lead  to  the  following  lemma: 

Lemma:  Let  {A*}  be  a  sequence  of  i.i.d.  random  variables  with  common 
distribution  F.  Assume  that  all  moments  of  A*  exist  and  that  the  skew¬ 
ness  measure  of  F  is  defined  by  Eq.(3).  Then  the  skewness  measure  of 
the  distribution  of  Sy  satisfies  the  following  relation 

lu  =  0(1 V-*'*)- 

However,  the  interest  here  is  to  consider  an  asymptotic  expansion 
valid  to  higher  order  terms  such  that  the  expansion  will  contain  the  skew¬ 
ness  measure.  In  fact,  these  higher  order  terms  are  important  since  tak¬ 
ing  one  or  two  more  terms  usually  improves  the  approximation 
'  significantly  and,  typically,  may  correct  the  skewness.  Further  investiga¬ 
tion  reveals  that  Eq.(2)  determines  an  equivalence  relation  between 
sequences  of  i.i.d.  random  variables.  We  can  say  that  two  sequences  of 
i.i.d.  random  variables  are  asymptotically  equivalent  if  their  asymptotic 
expansions  valid  to  r  terms,  for  the  sample  mean,  differ  by  o  (M~r/Z)  for 
each  r .  Thus  an  equivalence  class  based  on  this  relation  may  be  defined. 
And  therefore,  a  valid  asymptotic  expansion  defined  in  Eq.(2)-  may 
represent  a  class  of  sequences  of  i.i.d.  random  variables. 

We  now  proceed  to  present  a  Cornisb-Fisher  type  expansion  for  the 
sample  mean  of  a  sequence  of  i.i.d.  random  variables  (Aij.  The  following 
proposition  is  a  straightforward  result : 
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Proposition  1:  Let  }AiJ  be  a  sequence  o!  i.i.d.  random  variables  and  Sy 
be  its  siz e-M  sample  mean.  Assume  that  all  moments  of  A*  exist  and  that 
Ms 

I  1  «  1;  then  an  expansion  for  Sy  given  by 
cr 

VAE^ = * +  w* + (4) 

is  an  asymptotic  expansion  valid  to  two  terms  for  Sy,  where 
HQ-E[Xi\,  o2=El(Xi-no)2l,  it3=El(Xi-Mo)9{>  and  Z  is  a  standard  normal 
random  variable. 

If  the  skewness  measure  is  defined  by  Eq.  (3),  then  Eq.  (4)  obviously 
contains  this  measure.  Now,  by  the  use  of  some  weak  convergence 
theorems  on  pages  287  and  288  in  [14]  and  the  observation  that  VAE(Sy) 
is  a  continuous  function  of  Z,  ProposiMonZ  follows  immediately  from  Pro¬ 
position  l . 


Proposition  2:  Let  Yy  =  Mo  + 


M3 

Sc^Af 


{Zz- 1). 


Let  FYm  and  FSm  be 


the  distribution  functions  of  Yy  and  Sy  respectively.  Then,  under  the 
assumptions  in  Proposition  1,  Yy  converges  in  law  to  Sy  ;  namely, 


J55/V*  =  FSm 

at  each  continuity  point  of  Fs„- 

PropositionZ  enables  us  to  evaluate  the  asymptotic  statistics  of  Sy 
via  those  of  Yy.  As  a  matter  of  fact,  the  p.d.f.  of  Yy  exists  and  can  be 
evaluated  as  follows: 

Substituting  Eq.  (3)  into  Eq.  (4)  yields 

** +  &21-" 


Yy  =  VAE(Sy)  =  ^  + 


(5) 
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Then,  it  can  be  shown  that 

(i)If  f  0,  the  p.d.f  of  Yu  is  given  by 


(ii)  If  f<0.  the  p.d.f.  of  Yu  is  given  by 


1=0  otherwise 


The  preceding  discussions  have  been  focused  on  continuous  random 
variables.  It  should  be  noted  that  these  expansions  may  not  be  valid  for 
some  discrete  random  variables.  However,  in  the  context  of  this  chapter, 
only  binomially  distributed  random  variables  will  be  discussed  and  these 
will  be  seen  to  have  valid  asymptotic  expansions.  Thus  the  discussion  for 
the  discrete  case  will  be  omitted  here. 


3.3  The  Sample  Mean  Detector 

The  sample  mean  detector  is  known  to  be  the  Neyman-Pearson 
optimal  detector  when  the  underlying  noise  density  is  Gaussian.  This  sec¬ 
tion  considers  sample  mean  detector  performance  deterioration  under 
model  deviation  due  to  skewness.  The  asymptotic  expansion  introduced 
in  the  previous  section  will  be  employed  here  for  the  test  statistic,  and 


relations  between  the  performance  and  the  noise  skewness  measure  will 
then  be  obtained.  It  will  be  shown  that  the  performance  of  the  sample 
mean  detector  does  deteriorate,  as  expected. 

The  following  binary  hypothesis  testing  problem  is  considered: 

Ho.Xi-Ni 

K0  :  Xi=Ni+s  i=1’2 . M 

It  is  assumed  here  that  (Ail  is  a  sequence  of  i.i.d.  random  variables. 
Under  H0,  the  p.d.f.  of  X±  is  /  (xj,  which  is  nearly  Gaussian,  and  has  zero 
mean,  known  variance  a2  and  known  third-order  moment  /ig,  where  /ug  is 
small.  Now,  the  test  statistic  of  the  sample  mean  detector  is  given  by 

T(X)  =  \rtXi 

Miml 

As  discussed  in  Section  3.2,  there  exists  an  asymptotic  expansion  valid  to 
two  terms  for  T(X).  For  notational  convenience,  the  subscript  M  will  be 
omitted  in  the  sequel  and  the  asymptotic  expansion  for  T(X)  will  be 
denoted  by  Y.  Thus,  in  accordance  with  Eq.  (5), 

under  W0  (8) 

undergo:  Y=  YK  =  *  +  (®) 

Let  Ffj(y)  and  Fy(y)  denote  the  distribution  functions  of  Yh  and  Ir¬ 
respectively.  Now,  define  ay  and  &y  by 

au  -  f  dFfiiy)  (10) 

"b 

and 

Pu  -  f_  dF/civ)  (11) 

To 

where  f0  is  the  given  threshold.  Then,  by  Propositions  1  and  2,  it  can  be 
concluded  that  ay  and  fiy  converge  respectively  to  the  false-alarm  rate 


at  and  the  power  ft  of  the  sample  mean  detector.  Now, 


! 


i 


(i)  if  £  ^  0,  substituting  Eq.  (6)  with  Mo  =  0  and  m©  =  s  into  Eqs.  (10) 
and  (11)  respectively  yields 


au  -  ♦(-- M*(-  ^(r-°-)-+1-)  +  »(-L)»(-go(7,<>)  l) 
ft  ft  ft  ft 


(12) 


and 


f>n = ♦(-^-)»(-g,(^o)'t’1)  +  ) 

ft  ft  ft  ft 


(13) 


where 


Qo(T0)  -  Vl+atz+2f7,0/3ty  and  $,(7o)  =  Vl+ftz+2f(r0-s)/3o 
When  Af  is  sufficiently  large  and  £  is  small,  a#  and  Py  can  be 
approximated  by 

g0(r0)-i 


(Xy  w  $(— ■ 


ft 


(14) 


and 


P y  w  $(— 


Qs(T0)-l 


ft 


(15) 


(ii)  if  £  <  0,  substituting  Eq.  (7)  with  Mo  =  0  and  Mo  =  s  into  Eqs.  (10) 
and  (11)  respectively  yields 


■»«  =  »(-TL)[»(--°-y--‘--)  ~  *(;-)]  + 
ft  ft  ft 

♦(-L)[*(-£2ll2>±i}_*(^L)]  (l2a) 

ft  ft  ft 

and 

fin  =  »(^w--(v~-)  -  ♦(-*-)]  + 

ft  ft  ft 

(13a) 

ft  ft  ft 

Similarly,  when  M  is  sufficiently  large  and  £  is  small,  ay  and  Py  can 
be  approximated  by  Eqs.  (14)  and  (15)  respectively.  Since  ay  and  Py 
converge  to  at  and  Pi  respectively,  the  qualitative  behavior  of  a(  and  fa 
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can  be  studied  approximately  by  ay  and  fiy  respectively,  provided  that 
M  is  sufficiently  large.  Now  the  probability  of  error  Ptl  is  given  by 


P,i  "  -■)  +  $(-gq(ro)  l-)  (16) 

(Tt 

It  can  be  seen  from  Eqs.  (14)-(  16)  that  the  dependence  of  the  detector 
performance  on  the  noise  skewness  measure  comes  from  the  variables 
Qo(T0),  Qs(To),  and  at.  Figs.  3.A  -  3.C  provide  clearer  quantitative 
insights  as  to  sensitivity  of  the  sample  mean  detector  to  the  underlying 
noise  skewness.  One  can  see  that  both  the  false-alarm  rate  and  the  pro¬ 
bability  of  error  increase  as  the  skewness  measure  increases,  while  the 
power  stays  nearly  the  same,  an  indication  of  the  performance  deteriora¬ 
tion  of  the  sample  mean  detector  when  the  noise  skewness  comes  into 


Before  concluding  this  section,  it  may  be  worth  mentioning  that, 
throughout  the  above  discussion  ,  we  have  assumed  implicitly  that  the 
threshold  T0  lies  inside  the  region  where  the  corresponding  p  d.f.  for  Yy 
is  nonxero.  The  case  when  Tq  lies  outside  that  region  will  result  in  a 
singular  detection  problem  which  is  of  little  interest  here. 


3.4  The  Sign  Detector 

It  is  well  known  that  the  sign  detector  is  the  Neyman-Pearson  optimal 
detector  for  the  nonparametric  test  (HiKx) 

Hx  :p  -  1/2 

Ki  .p  >  1/2 

where  p  =  Pro b.  |  ^  a  0  j.  Also,  it  is  the  locally  optimal  detector  when 
the  underlying  noise  has  a  Laplace  distribution  [15], [16].  The 
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skewness  measure  £ 

Fig.  3. A  Rate  of  Change  in  False-alarm  Rate 

a0  is  the  false-alarm  rate  when  the  underlying  noise  is  Gaussian 
a0  =  l.OxlO-10 

The  threshold  is  set  by  a0  and  is  kept  fixed  when  the 


noise  skewness  measure  f  varies. 
Sample  size  =  400 


Fig.  3.B  Rate  of  Change  in  Power 


Pq  is  the  power  when  the  noise  is  Gaussian 

~  0.99999  for  the  linear  detector 
K  0.99998  for  the  sign  detector 

Signal  Strength  =  2  x  ( threshold  of  the  linear  detector  ) 

Sample  size  =  400 
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skewness  measure  f 


Fig.  3.C  Rate  of  Change  in  Probability  of  Error 


P,0  is  the  probability  of  error  when  the  noise  is  Gaussian 


r.o 


2.0xlO-10  for  the  linear  detector 

2.0*10"*  for  the  sign  detector 


Sample  size  =  400 


performance  of  the  sign  detector  when  the  underlying  noise  is  skewed  will 
now  be  studied.  It  is  easy  to  see  that,  when  the  underlying  noise  is 
skewed,  the  binary  test  (if j./fj)  really  becomes 

Hz:p  =  1/2  +  e(£) 

Kz:p>  1/2  +  c(S) 

where  e(()  denotes  the  deviation  depending  on  the  skewness  measure  £. 
In  order  to  be  compatible  with  the  previous  discussion,  the  following  nor¬ 
malized  test  statistic  is  considered: 

T(X)  =  (17) 

M  imi 

where 

fl  if  A>0 

s9ri  (Xi )  -  |q  otherwise 

Again,  an  asymptotic  expansion  for  T(X)  will  be  needed  in  order  to  facili¬ 
tate  analysis.  In  a  survey  paper,  Bickel  [17]  discussed  the  Edgeworth 
expansion  in  nonparametric  statistics  with  emphasis  on  higher-order 
approximations  to  the  distribution  of  those  statistics.  However,  the  test 
statistic  of  the  sign  detector  does  not  seem  to  have  been  discussed.  In 
this  section,  the  previously  considered  Comish-Fisher  type  asymptotic 
expansion  will  be  utilized  again  for  T(X).  The  first  three  central  moments 
of  T(X)  under  Hz  and  Kz  are  given  in  the  following: 

under  Hz  :  a*o=1/2  +  e($) 

o2  =  1/4  -  e8(f)  (18) 

and  /X3  =  -ZciOo2 
under  Kz  :  Pc  =  p  -  1/  2  +  c,  tf) 

o2  =p  -pz  =  1/4  -  c,*(£)  (19) 

and  V^-P  -  3p2  +  2ps  =  -2cJ($)o2 
where,  for  convenience,  p  is  taken  to  be  1/2 +e,(£)  under  Kz,  and  e,(£)  is 
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a  variable  depending  on  both  the  signal  strength  and  the  skewness  meas¬ 
ure.  Now,  following  the  discussion  in  the  previous  sections,  there  exist 
asymptotic  expansions  valid  to  two  terms  for  T{X)  under  both 
hypotheses,  namely, 


under  H2  :  YHl  =  1/2  +  t({)  +  ^/3ZI52S^  _  ^-{Z2  -  1) 

under  K2  >*,=  1/2+  e,(()  +  V —  -  1 ) 

Then,  by  going  through  the  same  procedures  as  of  Section  3.3,  we  can 
obtain  the  false-alarm  rate  a,  and  the  power  /Ss  for  the  sign  detector. 


a,  «  $( 


1-Wo) 


where 


3VAf(l/4-c2(f)) 


a,  - 


-2g,(0 

3>/Af(l/4-e*(e)) 


_  r  .  _2.  4e(«(r0+e(«-l/2) 

«(..)-  yi>M  3(1/4-c»(f)) 

rr(T0)  =  -u/ i+^»+^MIS^pEIZlI  • 

V  3(1/4-c2({» 

Thus  the  probability  of  error  P„  is 

P.,  »»(j?l(^o)~1)-»(1~^(7’o))  (22) 

u  f  O, 

Eqs.  (20)-(22)  reveal  how  the  performance  of  the  sign  detector  changes  as 
the  underlying  noise  skewness  changes.  The  dependence  on  the  skewness 
is  embedded  in  the  variables  o,,  o',,  R(T0)  and  R'{T0).  Figs.  3.A  -  3.C  also 
depict  the  quantitative  relation  between  the  performance  of  the  sign 
detector  and  the  noise  skewness.  The  noise  model  used  here  is  the  same 
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as  that  for  the  sample  mean  detector.  The  noise  p.d.f.  is  given  in  Eq.  (6) 
with  hq  -  0.  For  this  particular  model,  which  is  nearly  Gaussian,  the  sign 
detector  appears  to  be  less  sensitive  to  the  noise  skewness.  This  result 
is  reasonable  since  the  sign  detector  only  assumes  a  zero  median  of  the 
noise  distribution,  which  is  a  milder  assumption  than  zero  skewness.  On 
the  other  hand,  the  sample  mean  detector  assumes  the  Neyman-Pearson 
optimality  for  a  Gaussian  noise  model,  which  has  zero  skewness  measure. 

3.5  A  Modified  Sample  Mean  Detector 

In  Section  3.3,  the  sample  mean  detector  was  examined  with  skewed 
noise.  The  noise  model  used  there  is  a  Gaussian  model  perturbed  with  a 
small  amount  of  skewness.  It  has  been  shown  that  the  performance  of 
the  sample  mean  detector  deteriorates  when  the  underlying  noise 
becomes  skewed,  although  the  significance  of  this  performance  deteriora¬ 
tion  depends  on  the  particular  application.  It  is  then  interesting  to  con¬ 
sider  reducing  this  effect  by  modifications  of  the  detector  structure.  In 
this  section,  a  modified  scheme  is  proposed  based  on  the  asymptotic 
expansion  for  the  sample  mean  of  an  i.i.d.  random  sequence  discussed  in 
Section  3.2.  A  mean-squared  term  is  added  to  the  sample  mean  so  that, 
in  the  asymptotic  expansion,  the  terms  involving  the  lowest  power  of  the 
skewness  measure  can  be  eliminated.  Now,  with  the  assumption  that  the 
skewness  measure  is  small  and  that  the  sample  size  M  is  sufficiently 
large,  one  can  expect  that  the  effect  of  the  noise  skewness  is  essentially 
eliminated.  Again,  the  following  hypothesis  testing  problem  is  con¬ 
sidered: 
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Kq:  Xi  -  Ni  +  s  l"1,2 . M 

As  in  Section  3.3,  it  is  assumed  here  that,  under  Hq,  E(AiJ=0,  E(Ai2j=cr2 , 
and  E{XiZ\-fiQ,  where  M3  is  small.  Then,  consider  a  test  with  the  following 
test  statistic: 

rmvn  =  (23) 

where 

4  l i 

X  = 

M  (■! 

The  parameter  p  will  be  determined  as  follows: 

By  Proposition  1 ,  an  asymptotic  expansion  valid  to  two  terms  for  X 
exists,  namely 

T  =  -2=4.  +  -^-<Z2-1)  +  o  (AT1),  under  (24) 

vS  6<rM 

Substituting  Eq.  (24)  into  Eq.  (23)  yields 

W)  =  + pf<za-D  4  •<*-•) 

Thus,  if  p~- M3/604.  the  term  involving  Ms  "dll  be  eliminated  and  other 
terms  involving  higher  powers  of  the  skewness  measure  will  be  contained 
in  o(Af-1),  i.e. 

Tm(X)  =  ^4  +  o(AT*) 

Therefore,  when  the  skewness  measure,  f=Ms/  is  small  and  the  sample 
size  M  is  sufficiently  large,  Tm{ X)  can  be  weU  approximated  by  a  normal 
random  variable  which  is  independent  of  the  skewness  measure  of  the 
noise.  Hence  the  false-alarm  rate  am  and  the  power  pm  are  given  by 

am«l-  $(v!7  Tq/  o ) 


fin>*l-i(VM(T0-s)/a) 

where  To  is  the  prescribed  threshold.  Thus  it  can  be  seen  here  that  the 
performance  of  the  modified  sample  mean  detector  is  asymptotically 
independent  of  the  noise  skewness.  From  Eq.  (23)  and  the  above  discus* 
sion,  the  modified  sample  mean  detector  should  be  implemented  by  the 


following  test  statistic 


Tm(X)  =  X-  ^(X2) 


The  structure  of  this  modified  scheme  is  shown  in  Fig.  3.D.  In  prac¬ 
tice,  the  skewness  measure  £  may  not  be  known  exactly.  Then  a  learning 
procedure  or  adaptive  scheme  will  be  needed  to  obtain  an  estimate. 


3.6  Simulation  Examples 

To  investigate  the  modified  sample  mean  detector  proposed  in  the 
last  section  in  more  detail,  some  numerical  examples  based  on  Monte- 
Carlo  simulations  are  given  here.  For  simplicity,  the  noise  model  which 
has  a  probability  density  function  given  by  Eq.  (6)  is  used.  This  is  a  model 
resulting  from  a  Gaussian  distribution  perturbed  by  a  non-zero  skewness 
measure.  According  to  the  results  discussed  in  the  previous  sections,  the 
modified  scheme  is  expected  to  be  less  sensitive  to  noise  skewness,  com¬ 
pared  to  the  sample  mean  detector.  As  a  matter  of  fact,  it  should  be 
indifferent  asymptotically  to  noise  skewness.  The  following  examples 
verified  this  conclusion.  Fig.  3.E  shows  a  result  of  one  million  runs  with  a 
sample  size  of  100  and  the  a<j  ,  the  initial  false-alarm  rate  when  the  skew¬ 
ness  measure  £  is  zero,  set  equal  to  10"3.  Curves  are  plotted  for  the  rate 
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of  change  in  false-alarm  rate  as  the  skewness  measure  varies.  It  can  be 
seen  that  the  false-alarm  rate  of  the  modified  scheme  stays  nearly 
unchanged,  compared  to  that  of  the  sample  mean  detector,  as  £  changes. 
In  Fig.  3.F,  a  similar  set  of  curves  is  shown,  except  that  a0  =  10"4.  It 
should  be  noted  here  that  the  smoothness  of  the  curves  depends  on  the 
number  of  rims.  This  is  one  of  the  typical  properties  of  the  Monte-Carlo 
simulations.  However,  we  do  see  a  positive  verification  of  the  analytical 
results  obtained  previously. 

3.7  A  Case  Study  Using  Under-Ice  Ambient  Noise 

The  detection  scheme  discussed  in  Section  3.5  will  be  further  exam¬ 
ined  here  with  some  real  data  taken  from  under  ice  in  the  Arctic.  The 
data  were  preliminarily  analyzed  and  presented  in  [5].  The  mechanism 
for  the  noise  under  stationary  shore  ice  is  possibly  due  to  tensile  stresses 
caused  by  rapid  reduction  in  air  temperatures.  Noise  from  the  pack  ice, 
on  the  other  hand,  is  due  to  the  friction  between  interacting  and  colliding 
ice  floes  in  addition  to  tensile  s  tresses.  For  completeness  of  discussion, 
the  first  three  statistical  moments  of  the  data  are  shown  here  in  Figs.  3.G 
-  3.1  [5]  in  the  time  domain.  These  moments  were  estimated  in  1024  sam¬ 
ple  blocks,  which  were  identified  as  a  record  on  the  horizontal  axes  in 
these  figures.  The  samples  were  taken  at  a  sampling  rate  of  10 kHz,  thus 
each  record  represents  approximately  0.1  second  of  time.  It  is  seen  that 
the  data  has  a  non-zero  mean  which  is  due  to  the  carrier  frequency  of  the 
tape  recorder  being  slightly  misaligned.  Fig.  3.H  shows  a  large  variability 
of  the  variance  over  time.  This  illustrates  the  non-stationarity  of  the 
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skewness  measure  £ 

Fig.  3.F  Number  of  Runs  =  10* 

a0  =  1CT4 

Sample  Size  M  =  100 


noise.  Fig  3.1  depicts  the  skewness  measure  which  is  defined  in  Eq.  (3). 
It  can  be  seen  that,  for  some  records,  the  noise  exhibits  a  pronounced 
asymmetrical  property. 

The  purpose  of  this  study  is  to  investigate  two  detectors  (the  sample 
mean  detector  and  the  modified  one)  with  data  from  some  natural 
environment.  There  are  some  disadvantages  associated  with  the  use  of 
these  data,  however.  As  has  been  seen  from  Fig.  3.H,  the  data  represent 
a  nonstationary  process  and,  furthermore,  they  may  well  represent  a 
dependent  process.  These  two  characteristics  seriously  violate  the 
assumptions  discussed  in  this  chapter.  Hence  the  results  thus  obtained 
should  not  be  used  to  justify  the  analytical  results  of  the  previous  sec¬ 
tions.  In  an  attempt  to  eliminate  the  dependency  structure  in  the  noise, 
the  data  sequence  was  re-arranged  such  that  every  other  eight  consecu¬ 
tive  samples  is  used.  The  estimates  of  the  statistical  moments  are  then 
evaluated  based  on  the  re-arranged  data  sequence.  Thus,  each  record 
now  only  consists  of  120  sample  blocks.  Figs.  3.J  and  3.K  plot  the  false- 
aiarm  rate  and  the  probability  of  error  versus  the  threshold  for  the  two 
detectors  using  this  re-arranged  data  sequence  and  for  the  sample  mean 
detector  in  Gaussian  noise.  It  is  shown  here,  based  on  these  data,  that 
the  discrimination  between  these  two  detectors  does  not  appear  to  be 
significant.  It  should  be  noted  here  that  the  estimates  of  the  moments 
obtained  from  this  data  sequence  does  not  seem  to  be  different  from 
those  shown  in  Figs.  3.G  -  3.1. 
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Fig.  3.G  Statistical  Moment  of  Under-Ice  Noise  (  Mean ) 


«.«•  m.m  (hi.w  ian.w  wi.ih  Mi.w 

RECORD  NO. 


time 


Fig.  3.H  Statistical  Moment  of  Under-Ice  Noise  ( Variance. ) 
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Fig.  3.J  False-Alarm  Kate  vs.  Threshold 
( A  Semi-Log  Plot ) 

++++  Sample  mean  detector  in  Gaussian  noise 

_  Sample  mean  detector  under  ice 

A  A  A  A  Modified  sample  mean  detector  under  ice 
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Fig.  3.K  Probability  of  Error  vs.  Threshold 
( A  Semi-Log  Plot ) 

++++  Sample  mean  detector  in  Gaussian  noise 

■ - Sample  mean  detector  under  ice 

A  A  a  A  Modified  sample  mean  detector  under  ice 
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3.0  Conclusions 

The  problem  of  detection  with  skewed  noise  is  discussed.  The  sensi¬ 
tivity  of  the  sample  mean  detector  and  the  sign  detector  to  the  noise 
skewness  is  examined.  When  the  underlying  noise  is  Gaussian,  perturbed 
with  a  small  amount  of  skewness,  the  sign  detector  is  shown  to  be  less 
sensitive  than  the  sample  mean  detector.  A  modified  sample  mean  detec¬ 
tor  is  then  proposed  whose  test  statistic  is  constructed  by  adding  a 
correction  term  to  that  of  the  sample  mean  detector.  This  correction 
term  eliminates  the  lowest  power  of  the  skewness  measure  in  the  asymp¬ 
totic  expansion  of  the  test  statistic.  Thus,  when  the  skewness  measure  is 
small,  this  test  statistic  is  virtually  unaffected  by  noise  skewness. 
Although  the  analysis  here  is  based  on  the  assumption  of  large  sample 
size  and  small  skewness  measure,  it  does  provide  some  general  qualita¬ 
tive  insight  as  to  the  skewness  effect  on  detector  performance.  The 
simulation  results  in  Section  3.6  provide  a  support  to  the  analysis. 
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CHAPTER  4 


DETECTION  OF  SURE  SIGNALS  IN  NOISE  OF  BOUNDED  MAGNITUDE 

4.1  Introduction 

In  the  past  few  decades,  a  great  amount  of  literature  has  been 
devoted  to  the  theory  of  detection  of  signals  in  a  noise  environment  [l]- 
[8].  To  solve  a  detection  problem,  one  usually  has  to  impose  some  statist¬ 
ical  assumptions  on  the  noise  and  then  employ  methods  of  statistical 
inference.  Needless  to  say,  these  statistical  assumptions  ought  to  be 
introduced  with  the  consideration  either  of  providing  analytical  tractabil- 
ity  to  the  problem  or  of  being  compatible  with  practical  applications. 
Unfortunately,  these  two  factors  are  often  contradictory  to  each  other. 
For  example,  in  many  practical  situations,  the  assumption  that  the  noise 
sequence  is  a  sequence  of  statistically  well-defined  random  variables  is 
not  well  justified.  Hence  the  assumptions  on  the  noise  statistics  will  be 
just  an  approximation  or  simply  a  consequence  of  the  desire  for  analyti¬ 
cal  tractability. 

In  most  of  the  literature,  the  noise  process  considered  usually  has  a 
probability  density  function  with  infinite  support  and  the  tail  probabili¬ 
ties  are  extremely  small,  e.g.  the  Gaussian  density  function  or  the 
Laplace  density  function.  Practically  speaking,  the  tail  area,  considered 
sufficiently  far  out,  is  almost  negligible.  In  fact,  due  to  some  environmen¬ 
tal  limitations,  the  noise  may  actually  be  bounded  in  magnitude.  Furth¬ 
ermore,  the  availability  of  a  well  justified  bound  on  noise  magnitude  is 
usually  more  realistic  than  that  of  the  well  justified  statistical  knowledge 
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of  the  noise.  Thus  the  assumption  of  bounded  noise  may  lead  to  more 
practical  applicability  of  the  solutions.  As  a  matter  of  fact,  bounded 
noise  assumption  has  been  employed  in  considering  state  estimation  [9]- 
[10]  and  system  identification  problems  [11]. 

This  chapter  considers  a  detection  problem  using  a  bounded  noise 
assumption  and  proposes  a  rather  different  solution  to  this  detection 
problem.  To  begin  with,  in  Section  4.2,  the  signal  is  assumed  to  be  a 
known  positive  constant  and  no  statistical  assumptions  on  the  noise  are 
employed.  Then  a  sequential  test  procedure  is  devised  based  only  on  the 
knowledge  of  the  noise  bound.  In  contrast  to  the  conventional  hypothesis 
testing  procedure,  this  procedure  yields  a  singular  solution.  Specifically, 
it  involves  no  possibility  of  making  wrong  decisions  provided  that  the  test 
procedure  terminates.  Now,  one  may  argue  that  the  presumed  bound  on 
the  noise  magnitude  may  not  be  precise;  thus  the  resulting  singular  solu¬ 
tion  will  not  be  realistic.  One  way  to  circumvent  this  problem  is  to  intro¬ 
duce  some  randomness  to  the  bound;  namely,  to  suppose  that  the  bound 
is  a  random  variable  with  a  known  bounded  distribution.  Then  the 
sequential  procedure  becomes  a  random  test  in  the  sense  that  the  thres¬ 
hold  and  the  associated  performance  are  functions  of  a  random  variable. 
In  Section  4.3,  examples  of  some  particular  distributions  of  the  noise 
sequence  for  a  given  realization  of  the  presumed  bound  are  used  to 
demonstrate  this  example.  Section  4.4  addresses  the  duality  of 
hypothesis  testing  and  estimation.  Section  4.5  then  discusses  the  prob¬ 
lem  of  detecting  an  unknown  signal  in  bounded  noise  where  the  detection 
problem  will  be  formulated  as  a  problem  of  estimating  the  signal. 


4.2  Problem  Formulation 


Suppose  that  we  consider  the  problem  of  deciding  whether  or  not  a 
constant  signal  is  present  by  observing  a  noisy  data  sequence  fzj  {.  This 
problem  can  always  be  formulated  as  the  following  canonical  hypothesis 
testing  problem: 

H  :  *i  =  n* 

1=1,2....  (1) 

K :  a*  =  n*  +  s 

The  essential  assumption  to  be  discussed  in  this  chapter  is  that  the  noise 
sequence  j  is  bounded  in  magnitude,  namely 

n*z  &  B 2  for  any  i ,  where  B>  0  (2) 

Now,  if  it  is  assumed  here  that  the  signal  is  a  known  positive  constant, 
then 

under  H:  n?*.Bz  e  - B<xi<B  (3) 

under  A":  nf^B2  o  s-2?£ZjSs+f?  (4) 

From  Fig.  4. A,  it  can  be  seen  that  **<5-2?  only  if  H  is  true  and  xi>B  only 

if  K  is  true.  Thus,  to  solve  the  detection  problem,  one  can  proceed  as  fol¬ 
lows: 

1.  Obtain  the  data  sequence  (x*{  sequentially. 

2.  For  any  i,  if  **>2?,  stop  and  decide  K.  On  the  other  hand,  if 
Xi  <s  -B,  stop  and  decide  H. 

3.  If  s  -B  <Xi  <B,  obtain  another  observation  Xj+1  and  repeat  step  2. 

It  is  clear  now  that  if  the  bound  B  is  known  exactly,  and  if  the  test  pro¬ 
cedure  terminates,  one  can  expect  no  possibility  of  making  wrong  deci¬ 
sions.  In  contrast  to  conventional  detection  problems,  this  is  a  singular 
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problem  in  the  sense  that  this  test  procedure  leads  to  a  solution  of  zero 
false-alarm  rate  and  unity  power.  This  result  is  inherent  from  the 
assumption  imposed  here,  namely,  that  the  noise  sequence  is  bounded  in 
magnitude. 

The  sequential  procedure  discussed  here  involves  some  similarities 
to  the  standard  sequential  probability  ratio  test  ( SPRT )  [12].’  In  the 
SPRT,  one  assumes  that  the  noise  sequence  is  a  sequence  of  random 
variables  with  a  known  probability  distribution.  Then  the  likelihood  ratio 
L  is  computed  and  is  usually  a  function  of  time.  Two  thresholds  Ty  and 
Tz,  with  0<7’i<1<T2.  are  chosen  based  on  the  desired  values  of  the  false- 
alarm  rate  and  the  power.  The  likelihood  ratio  L  is  then  compared  to  the 
thresholds  T j  and  Tz.  If  it  exceeds  Tz  at  one  of  the  comparison  instants, 
the  test  terminates  with  the  decision  for  K  (signal  present).  If  the  likeli¬ 
hood  ratio  decreases  below  Tlt  the  test  terminates  with  the  decision  for 
H  (signal  absent).  If  L  remains  between  T\  and  Tz,  another  observation 
is  made.  On  the  other  hand,  instead  of  computing  the  likelihood  ratio 
each  time  after  making  an  observation,  the  sequential  procedure  dis¬ 
cussed  in  this  chapter  simply  compares  the  observed  data  **  to  the  two 
thresholds  B  and  s  -B  to  make  decisions.  Thus  the  test  considered  here 
requires  less  information  about  the  noise  and  involves  less  computational 
complexity.  However,  another  important  criterion  for  evaluating  perfor¬ 
mance  of  a  sequential  procedure  is  the  number  of  samples,  or  the  aver¬ 
age  sample  number  ( ASN ),  required  for  termination.  To  consider  the 
ASN,  let  us  start  by  showing  that  the  sequential  procedure  described 
here  will  terminate  with  probabflity  one,  under  some  general  conditions. 
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Suppose  that  the  probability  that  the  observed  data  x*  lies  in  the 

interval  [s-B,B]  is  pit  and  that  0<pi<l  for  all  i.  Furthermore,  if  the 

observation  sequence  {x*}  is  an  independent  sequence,  then  at  the  fc-th 

observation,  the  probability  that  this  test  procedure  has  not  terminated 
k 

is  UPi-  However, 

»«i 

* 

0 ss  lim  n  Pi  *  =  0 

where 


P  mai  ~  Pi 

l*i  *k 


The  above  argument  leads  to  the  following  lemma. 

Lemma  Let  {  be  an  independent  sequence.  Let 

Pi  =  Prob.  fa e[s-B,B]l  Then,  if  0<p,<l  for  all  i,  the  sequential  test 
procedure  discussed  in  this  chapter  terminates  with  probability  1. 

Knowing  that  the  test  procedure  will  terminate  with  probability  1, 
one  can  then  consider  the  ASN.  Now,  if  it  is  assumed  that  the  observa¬ 
tion  sequence  is  independent  and  identically  distributed,  then,  according 
to  the  strong  law  of  large  numbers  [13],  one  may  evaluate  the  ASN  as  fol¬ 
lows: 


(a).  Suppose  that  the  noise  sequence  (A\}  is  uniformly  distributed 
between  [—B,B\,  then,  under  H: 


Prob.\{x.-B±x±s-B)\  = 

Thus  the  expected  value  of  the  sample  number  M,  or  the  ASN,  is 
given  by 

Eh(M)  =  —  -*  ASN 

s 


Similarly,  under  K : 


Ek(M)  =  22-  -»  ASN 

One  can  easily  see  that  the  two  ASW's  may  not  be  equivalent 
under  both  hypothesis  if  one  allows  some  asymmetry  assump¬ 
tion.  Furthermore,  it  should  be  noted  that,  if  B»s,  then  the 
probability  that  the  observation  will  lie  in  [-j B,s-B]  will  be  very 
small  and  thus  the  ASN  will  be  very  large.  On  the  other  hand,  if 
B  and  s  are  comparable,  one  can  expect  a  rapid  termination  of 
the  test. 

(b).  If  the  noise  sequence  f  {  has  a  truncated  Gaussian  distribution, 
i.e.  if  it  has  a  density  function  given  by 


/at(*0 


_ 1 _ 

V2rro2(l-2$(-fi/o)) 

0 


€xp-nV2c*  if  |n  |s  5 
otherwise 


Then  it  can  be  shown  similarly  that 


ASN  -  _ (1  —  2$( — B/ a  )) - 


under  either  H  or  K. 


4.3  Constant  Known  Signals  in  Noise  of  Random  Bounds 

In  the  previous  section,  the  problem  of  detecting  a  known  signal  in  a 
magnitude  bounded  noise  has  been  discussed.  A  sequential  test  pro¬ 
cedure  was  proposed  which  leads  to  a  singular  solution  However,  there 
are  some  obvious  problems  associated  with  this  result.  The  presumed 
bound  may  not  be  tight,  and  thus  the  number  of  samples  required  to  ter- 
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minate  the  test  procedure  may  be  unnecessarily  large.  Furthermore,  in 
practice,  the  bound  may  not  be  known  exactly,  and  could  even  be 
assumed  smaller  than  it  actually  is.  Therefore,  to  make  the  problem 
more  realistic,  one  may  assume  that  the  presumed  bound  is  a  random 
variable  with  a  known  distribution  function.  Then,  clearly,  the  two  thres¬ 
holds  will  also  be  random  variables  and  the  solution  thus  obtained  will 
involve  non-zero  probabilities  of  false-alarm  and  miss. 

Now,  for  simplicity  of  discussion,  one  may  suppose  that  the 
presumed  bound  B  is  a  random  variable  uniformly  distributed  between 
[r— e,r+e],  where  r  is  the  actual  bound  of  the  noise  sequence  fn*}.  A 
realization  of  B  will  be  denoted  by  6 .  Then,  assuming  that  the  signal  is  a 
known  positive  constant,  the  two  thresholds  s-B  and  B  will  be  random 
variables  uniformly  distributed  between  [s-r-e.s-r  +e]  and  [r-e,r+e], 
respectively.  It  can  be  seen.  Fig.  4.B,  that  a  false-alarm  (type  I  error) 
occurs  when  b>r  and  x*>r.  Hence  the  probability  of  false-alarm  (false- 
alarm  rate)  is  given  by 

a(6)  =  Prob.  {6  >r,zi>r  j 

=  Prob.  {*j>r  1 6  >r  j  Prob.  [6>r  j  (7) 

Moreover,  a  miss  (type  II  error)  occurs  when  b  >r  and  zi<s-r.  Hence  the 
probability  of  miss  is 

1-0(6)  =  Prob.  (6>r,zi<s-r  j 

=  Prob.  (z^s—r  |6>rj  FVo6.  [6>rj  (8) 

Thus  the  power  0(6)  is  given  by 

0(6 )  =  1  —  Prob.  fz*  <s  — r  1 6  >r  j  Prob.  f 6  >r  J  (9) 

It  is  seen  in  Eq.  (7)  and  Eq.  (8)  that  both  the  false-alarm  rate  a  and 


A  False-alarm  occurs  when  b  >  r  and  x*  e  [  r  ,b  ] 
A  Miss  occurs  when  b  >r  and  xt  e  [  s  -6 ,  s  -r  ] 


Fig.  4.B  A  Sequential  Test  Procedure  with  Random 


Bounds  On  Noise  Magnitude 
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the  power  /9  are  functions  of  a  random  variable  B  and  thus  are  also  ran¬ 
dom  variables.  Thus  this  test  procedure  can  be  regarded  as  a  random 
sequential  procedure  as  its  thresholds  are  random  and  also  its  perfor¬ 
mance.  Properties  of  this  test  procedure  will  clearly  depend  on  the  sta¬ 
tistical  distributions  of  the  false-alarm  rate  and  the  power.  The  following 
sections  employ  some  statistical  assumptions  on  the  noise  sequence  and 
provide  clearer  insights. 

4. 3. 1  UnifoTmly  Distributed.  Random  Noise 

Suppose  that  for  any  presumed  bound  6 ,  the  noise  sequence  [ti*  ]  is 
uniformly  distributed  between  [—6,6].  Thus,  from  Eq.  (1),  under  H,  (x*{ 
is  uniformly  distributed  on  [-6,6]  and,  under  K,  [xt{  is  uniformly  distri¬ 
buted  on  [s— 6,s+6],  It  may  be  interesting  to  see  that  the  probability 
density  function  of  under  H  is  given  by 


Px{*) 


if  r  +e  Sr  x  >  r-e 

II 

*h 

sr 

-TPi 

4? 

if  r-c  x  >  -r+c 

= i  '-o 

if-r +e  x  >  -r—c 

=  0 

otherwise 

(10) 


Also  the  probability  density  function  of  A*  under  K  is  simply  a  shift 
to  the  right  of  Eq.  (10).  As  discussed  previously,  false-alarms  occur  when 
6>r  and  x,>r,  while  misses  occur  when  6>r  and  xi<s-r.  The  false- 
alarm  rate  and  the  power  can  then  be  evaluated  via  Eqs.  (7)  and  (9), 
respectively.  From  Eq.  (7), 


a(6)  =  Prob.  [x*>r  |6>r[  Prob.  [6>r{ 

-  —  ~r  1  -  1  r 
26  '  2  ”  4  ”  46 


< 


I 


4 


(11) 
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Similarly,  from  Eq.  (9), 

0(6 )  =  Prob.  fa  <s  —r  1 6  >r  j  Prob.  (6  >r  | 

=  h  w  (l2) 

Now  that  both  the  false-alarm  rate  and  the  power  are  functions  of  a 
random  variable  [Eqs.  (11)  and  (12)]  it  is  essential  to  consider  their  sta¬ 
tistical  properties;  e.g.  the  probability  distributions  and  the  statistical 
moments.  The  probability  density  function  of  a(6 )  is  given  by 


and  that  of  0(6 )  by 


2r 


—i 


e(l-4a)2 

0 


“  4(r-e) 
otherwise 


s  a  ^ 


4(r+e) 


(13) 


fB{P) 


Zr 


e(  1-40)2  4  4(r+e) 

=  0  otherwise 


4  4(r  -e) 


(14) 


Moreover,  the  first  two  statistical  moments  of  the  false-alarm  rate  and 
the  power  are  given  by 


where 


E[a(b)]=  jplnd+t) 


T 

W)]  =  J-+  ^(1+0 


(15) 


(16) 


and 


nar[a(6)]  =  uar[0(6)] 

=  mw  3k**(l+t)  -  <17> 

Eqs.  (13)  and  (14)  show  that  both  a  and  0  have  bounded  distributions  with 
an  algebraic  tail  behavior.  In  practice,  one  may  be  interested  in  cases 


-80- 

where  t«l,  then 

in(i+i)  = 

Thus 

£[«(*)]  "Jq 

E[f}(b)]*l-±i 

and 

Var[a(6)]  =  Vbr[/S(6)]  *  -^-t2 

Hence  if  f«l,  the  first-order  moments  of  both  a  and  /S  are  monotone 
linear  functions  of  t  while  the  variance  is  a  quadratic  function  of  t.  This 
result  suggests  a  reasonable  conclusion  that,  the  smaller  i  is,  the  smaller 
is  the  probability  of  error  involved  in  this  test  procedure.  Figs.  4.C-4.E 
depict  the  first  two  moments  of  a  and  /?  as  functions  of  t  as  t  ranges 
from  0.001  to  0.1  . 

4.3.2  Random  Noise  With.  Triangular  Distribution 

An  example  of  a  unimodal  and  bounded  distribution  is  the  triangular 
distribution  whose  density  function  is  given  by 

=  f(1_JT^  a  1*1  *  6 

fx(x)  =  o  otherwise 

Again,  ay  assuming  that  the  presumed  bound  B  is  uniformly  distributed 
between  [r-e,r+e],  one  may  evaluate  the  false-alarm  rate  and  the  power 
via  Eqs.  (7)  and  (9),  respectively. 

a(b)  =  Prob.  {zt>r  |6  >r} /Vob.  {6  >r  j 


< 


Now,  the  probability  density  functions  of  a(6)  and  /3(6)  are  given  by 


/a  («) 
and 


_  _1 _ r _ ,  J _ r _ 

4e  Va(l-2Va)2  4e  Va(l+2v'a)2 
=  0  otherwise 


if _ £! _ fee* _ £1 _ 

4(r-e)2  4(r+e)2 


_  1 


4e  VT^(1-2VT^)  4c  VT-/5(1+2VT^3) 


if  1  - 


4  (r  -c)2 


=  0  otherwise 

The  first  two  statistical  moments  of  a  and  /S  are  given  by 


*'*l-«* 


f  +  atirrr-  ■hln{M) 

tz 

if  t  «  1 
o 

and 


(10) 


£[«6)]  =  r_  5^^+  i(»(i+t) 

tz 

«l-7-  if  t  «  1 

o 

Vbr[a(6)]  =  Kar[/5(6)] 

+  UT  ~  et  (i +t)«  +  i6«(i+o 

W~f2  if  t  «  1 


(19) 

5 

32(l+f) 

_ 1_ 

2  96f  (1+f  ): 

(20) 


A  similar  set  of  curves  to  those  in  Figs.  4.C-4.E  describing  the  first  two 
moments  of  a  and  /S  as  functions  of  t  are  given  in  Figs.  4.F-4.H.  Here, 
instead  of  an  approximately  linear  relation,  an  approximately  quadratic 
relation  between  the  mean  and  t  is  shown.  This  relation  can  also  be  seen 
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Vfrr[a(6)]  =  Var[{3(b)] 


0.00015 

0.00014 

0.00013 

0.00012 

0.00011 

0.0001 

0.00009 

0.00008 

0.00007 

0.00006 

0.00C05 

0.00004 

0.00003 

0.00002 

0.00001 


0.1 


■i 


■i 


4 


Fig.  4  H  Variance  of  a  and  p 
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from  Eqs.  (18)  and  (19).  The  difference  certainly  results  from  the 
different  assumptions  on  the  noise  distributions  employed.  However,  as 
cam  be  seen  from  Figs.  4.F-4.H,  the  effect  of  inaccuracy  of  the  presumed 
bound  on  the  noise  on  the  probability  of  error  is  less  than  that  in  the  pre¬ 
vious  case.  This  is  due  to  a  smaller  tail  probability  of  the  noise  distribu¬ 
tion  used  here. 


4.3.3  Random  Noise  with.  Truncated  Gaussian  Distribution 

In  the  last  two  sections,  the  noise  sequence  was  assumed  to  have  a 
bounded  distribution,  i.e.  uniform  distribution  and  triangular  distribu¬ 
tion,  with  a  random  bound  which  has  a  uniform  distribution.  However, 
these  distributions  may  not  be  of  sufficient  practical  interest.  In  applica¬ 
tions  such  as  radar  or  sonar,  the  noise  density  function  often  exhibits  a 
Gaussian-shaped  central  part,  although  the  magnitude  of  the  noise  may 
be  bounded  by  some  equipment  limitations.  The  probability  density  func¬ 
tion  of  a  random  variable  with  a  truncated  Gaussian  distribution  is  given 
by 


fx(x ) 


=  -£l^-exp_**/2ff*  if  jx|;s6 


V2tt(7 
=  0 


otherwise 


(21) 


where  C(6)  is  a  normalization  constant  and  is  given  by 


C(6  )=[*(£)-*<  ^4]-’ 

a  o 

As  in  the  previous  sections,  the  bound  6  is  assumed  to  be  a  random  vari¬ 
able  uniformly  distributed  between  [r-e.r+e],  Then,  by  Eqs.  (7)  and  (9), 
the  false-alarm  rate  and  power  are  given  by 
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a(6 )  =  Prob.  fs*  >r  ,6  >r  { 

=  -4(^1 

C  a  a 

and 

n(  6)  =  i 

2  O  O 

Because  of  the  functional  forms  involved  in  the  density  function,  Eq.  (21), 
the  statistical  moments  of  a  and  0  are  hardly  expressible  in  a  simple 
form.  However,  curves  depicting  them  as  functions  of  t  can  be  obtained 
by  using  numerical  methods  as  shown  in  Figs.  4.I-4.K.  Again,  these  curves 
are  obtained  for  t  varying  from  0.001  to  0.1  .  Also,  the  value  of  r  is  set  at 
4a  .  as  the  tail  probability  of  a  Gaussian  distribution  beyond  this  point  is 
sufficiently  small.  A  smooth  monotone  nondecreasing  relation  between  t 
and  the  first  two  moments  of  the  probability  of  error  can  be  seen  from 
these  curves.  This 'is  compatible  with  the  intuition  that  the  more  accu¬ 
rate  the  presumed  b^.  ;nd  is,  the  less  probability  of  error  will  result  from 
this  test  procedure. 

To  investigate  in  more  detail  the  sequential  test  procedure  here,  one 
may  be  interested  in  comparing  its  performance  to  that  of  the  standard 
SPRT.  It  is  obvious  that  both  error  probabilities  converge  to  zero  as  the 
bound  on  the  noise  distribution  goes  to  infinity,  namely 

lim  a(6)  =  0  almost  surely 

r-»«» 

and 

lim  0(6)  =  1  almost  surely 

Unfortunately,  the  ASN  will  be  infinite  in  the  limit.  A  similar  result  can 
be  expected  from  the  standard  SPRT.  Thus,  to  compare  these  two  test 
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procedures,  one  has  to  consider  rate  of  convergence  of  the  error  proba¬ 
bilities.  In  [12],  it  is  shown  that  the  ASN  of  the  SPRT  under  H  and  under 
K  are  given  respectively  by 

„  /„x  _  (1  ”  «)  losT’i  +  «  1087*2 

EhW - B&. 5 - 

and 

r  _  0 logT* i  +  (1  -0)log Tz 
K  Efc{z) 

where  z  is  the  test  statistic  and  T\  and  Tz  are  two  thresholds  given  by 

7’1  =  0/(l-a)  and  7*8  =(1-0) /a 
Thus  one  can  see  that  for  the  same  a  and  the  same  0,  the  ASN  of  the 

SPRT  is  less  than  that  of  the  sequential  procedure  here.  On  the  other 
hand,  a  complete  knowledge  on  the  noise  density  function  required  by  the 
SPRT  and  the  computational  simplicity  involved  in  the  proposed  sequen¬ 
tial  procedure  should  also  be  taken  into  account  in  making  comparisons. 

4.3.4  Random.  Noise  with  Truncated  Laplace  Distribution 

Another  distribution  which  could  be  of  practical  interest  is  the 
Laplace  distribution.  As  in  the  previous  section,  the  truncated  version  of 
the  Laplace  distribution  will  be  considered  here  with  probability  density 
function  given  by 

_  AC^Lexp-*|»|  jj  |z|£b 
=  0  otherwise 

where  C(6)=(l-e~M)~1  is  a  normalization  constant.  Again,  by  assuming 
that  the  bound  6  is  a  uniformly  distributed  random  variable  on 
[r-c.r+e],  one  may  evaluate  the  false-alarm  rate  and  the  power  via  Eqs. 


*  1 

"  i 


(7)  and  (9), 


a(6)  =  Prob.  fzt>r,6>r} 

and 

0(6)  =  assuming  that  s  >r 

4 

Again,  in  order  to  achieve  some  insights  as  to  the  statistical  properties  of 
a  and  0,  one  has  to  employ  numerical  techniques.  Figs.  4.K-4.N.  shows 
the  first  two  moments  of  a  and  0  as  functions  of  t.  A  smooth  monotone 
nondecreasing  relation  between  t  and  the  probability  is  also  shown  here. 
However,  comparing  to  the  results  in  the  previous  section,  greater  proba¬ 
bility  of  error  is  obtained  here  due  to  the  heavier  tail  probability  in  the 
Laplace  distribution. 

4.4  Duality  Between  Hypothesis  Testings  and  Confidence  Regions 

In  the  previous  section,  detection  of  a  known  signal  in  noise  has  been 
formulated  as  a  canonical  hypothesis  testing  problem,  namely, 

# :  *i  =  n< 

i=l,2,... 

K:  *i  =  n*  +  s,  s>0 

This  is  equivalent  to  testing  H\  t?=a<o  vs  K\  d=/io+s>  *here  j  and 

However,  if  the  signal  strength  is  unknown,  this  is  equivalent 
to  testing  H :  vs  K:  i.e.  any  value  of  d  other  then  n o  is  a  possi¬ 

ble  alternative.  One  can  then  consider  this  problem  as  one  of  estimating 
the  mean  of  the  observation  data  sequence,  'j  a  matter  of  fact,  there 
exists  a  complete  duality  between  families  of  level  a  tests  and  level  (l-o) 
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confidence  regions.  To  explain  this  concept  in  more  detail,  let  us  con¬ 
sider  the  following  example  [14]:  Suppose  that,  on  the  basis  of  m  obser¬ 
vations,  Xj,X2,...,zm,  one  wants  to  decide,  with  the  assumption  that  the 
Xj's  are  i.i.d.,  whether  or  not  the  population  mean  of  [AiJ  is  Ho-  Then  a 
size-a  test  can  be  based  on  a  level  (1  -  a)  confidence  interval  as  follows: 
Accept  H.  if  and  only  if,  the  postulated  value  hq  is  a  member  of  the  level 
(1-a)  confidence  interval  [X  -  a,X  +  a]  where  a  is  a  function  of  (1  -  a) 
and  fiQ.  Thus,  if  a  likelihood  ratio  test  is  considered,  by  properly  choosing 
the  threshold  T0,  one  can  associate  the  test  with  the  confidence  interval. 
However,  since  the  same  interval  is  used  for  every  Hq,  there  actually  is 
generated  a  family  of  level  a  tests  with  parameter  0.  Conversely,  fami¬ 
lies  of  tests  can  generate  confidence  bounds  and  intervals.  Let  fdtX.tf)} 
be  a  family  of  tests  such  that  d(A',/x0)  is  a  test  of  level  of  significance  a  for 
testing  H:  -  Mo  lor  each  ^€0  where  0c R  and  where  «5(A  ,t?)  denotes 
the  critical  function  of  a  test.  For  fixed  x,  define  the  subset  C(x)  of  0  by 

C(x)  =  f-tf:d(x,i»  =  0{ 

This  is  just  the  set  of  all  0  that  would  be  accepted  if  A=x  is  observed  and 
the  given  family  of  tests  is  used.  Suppose  that  C(x)  is  of  the  form 
<a(x),°o)  n  0  for  each  x,  where  <  indicates  that  the  point  o(x)  may  or 
may  not  be  in  the  indicated  ray.  Then,  a(x)  is  a  lower  confidence  bound 
of  level  (1  -  a)  for  .  This  is  true  since 

Prob.  [a (A)  £  Prob.  e  C(x)}  =  Prob.  =  Oj 

=  1  -  Prob.  fd(x,i9)  =  1}  St  1  -  a 

Similarly,  if  C(x)  is  of  the  form  <o(x),6(x)>  n  0  for  each  x,  then 
[a(x),6(x)]  is  a  level  (1  -  a)  confidence  interval  for  tf.  Also,  if  C(x)  is  of 
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the  form  (-»,6(x)>  n  0  for  each  x,  then  6(z)  is  an  upper  confidence 

• 

bound  of  level  (1  -  a)  for  i?.  In  fact,  both  confidence  bounds  and  intervals 
can  be  considered  as  random  subsets  of  the  parameter  space.  The  true 
parameter  value  is  included  in  these  subsets  with  probability  at  least 
(1-a). 

The  above  discussion  demonstrated  a  complete  duality  between 
confidence  intervals  and  hypothesis  testings  and  also  provides  the 
motivation  for  casting  the  hypothesis  testing  problem  as  an  estimation 
problem. 

4.5  Detection  of  Unknown  Constant  Signals  in  Bounded  Noise 

The  problem  of  detecting  known  constant  signals  in  bounded  noise 
has  been  discussed  in  the  previous  parts  of  this  chapter.  When  the  bound 
on  the  noise  quantity  is  a  known  constant,  a  sequential  procedure  which 
leads  to  a  trivial  solution  to  the  detection  problem  was  considered.  On 
the  other  hand,  when  the  bound  is  a  random  variable,  this  sequential  pro¬ 
cedure  becomes  a  random  test.  By  imposing  some  statistical  assump¬ 
tion,  e.g.  known  distribution,  on  the  bound  and  also  on  the  noise,  one  can 
then  describe  some  statistical  properties  of  this  random  test.  In  particu¬ 
lar.  the  first  two  moments,  or  even  the  probability  density  functions,  of 
the  false-alarm  rate  and  the  power  can  be  obtained. 

In  this  section,  a  similar  problem  will  be  considered.  However,  the 
signal  strength  will  be  assumed  to  be  unknown.  To  solve  this  detection 
problem,  one  may  consider  either  of  the  following  two  approaches:  (1) 
Formulate  performance  of  the  test  procedure  as  a  function  of  the  signal 


strength  and  then  optimize,  in  some  sense,  the  performance  with  respect 
to  the  signal  strength.  (2)  Cast  the  detection  problem  as  an  estimation 
problem.  In  the  last  section,  duality  between  hypothesis  testings  and 
confidence  regions  has  been  discussed.  It  was  shown  that  the  detection 
of  an  unknown  constant  signal  can  be  considered  as  an  estimation  of  the 
mean  of  the  observation  data. 

Now  the  hypothesis  testing  problem  given  in  Eq.  (1)  is  considered, 
where  s  is  unknown  and  n^Bz,  for  any  i.  As  discussed  in  Section  4.4, 
this  problem  can  be  formulated  as  follows: 

//,:«  =  0 
0 

where  1I=E\Xi\  and,  of  course,  we  have  assumed  that  £’|A^{=0  for  any  i. 
One  of  the  easiest  ways  to  estimate  the  mean  of  the  observation  data  is  to 

consider  the  sample  mean,  £  ^»-  However,  we  now  proceed 

rather  differently  as  follows: 

Let  Gj=far:  and  take  intersections  of 

Cg's  for  1,2,...,  and  define 

in  -  n  Q 

i-l 

It  is  clear  that  the  true  value  of  the  mean  of  the  Xj's  should  lie  inside  Iu, 
for  each  M.  In  this  section,  we  consider  the  estimate  of  the  mean  to  be  a 
set  rather  than  a  single  point.  Thus  every  point  in  /jf  is  ah  estimate  here. 
If  a  set  function  nu  is  defined  for  the  measure  of  In  as  follows: 

Mjf  =  I(/j t)  -  length  of  In, 

then,  obviously,  fin  is  a  monotone  nonincreasing  function  of  M.  Now,  if 
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we  assume  that  Gi-Gj for  any  t *j ,  then 

jim  Mjf=  0, 

and  thus  the  true  value  of  the  mean  evolves  in  the  limit.  Therefore, 
based  on  these  arguments,  one  can  solve  the  detection  problem  as  fol¬ 
lows: 

1.  If  xfeB2,  stop  and  decide  K 

2.  If  xf^B2,  obtain  Q  and/* . 

3.  When  stop.  If  decide  H,  otherwise  decide  K.  Here  t 

is  a  small  constant  which  is  properly  chosen  so  that  the  sequen¬ 
tial  procedure  can  be  truncated  at  some  finite  number  of  sam¬ 
ples. 

The  only  possibility  of  making  wrong  decisions  in  this  procedure  is  to 
decide  H  while  K  is  true,  i.e.,  the  false-alarm.  The  signal  strength  and 
the  choice  of  e  clearly  affect  this  possibility.  Again,  the  assumption  of 
constant  bound  on  the  noise  quantity  leads  to  a  singular  solution. 
Another  reason  for  this  result  is  that  the  sequential  procedure  discussed 
here  does  not  utilize  any  statistical  information,  which  in  some  practical 
situations  may  not  be  easy  to  obtain,  and  thus  does  not  involve  any  sta¬ 
tistical  procedure.  One  disadvantage  associated  with  this  sequential  pro¬ 
cedure  is  that  the  number  of  samples  required  to  terminate  the  test  may 
be  too  large.  However,  an  obvious  advantage  here  is  the  computational 
simplicity  involved  in  implementing  this  procedure. 


4.6  Conclusions 


In  this  chapter,  detection  of  a  constant  signal  in  bounded  noise  has 
been  considered.  Both  the  known  signal  and  unknown  signal  cases  are 
addressed.  In  the  former,  if  the  bound  is  a  known  constant,  the  solution 
is  singular,  namely,  both  the  false-alarm  rate  and  the  probability  of  miss 
are  zero.  To  make  the  problem  more  realistic,  we  considered  noise  with  a 
random  bound;  then  a  random  test  is  obtained.  Some  statistical  proper¬ 
ties  of  the  performance  parameters,  e.g.  the  false-alarm  rate  and  the 
power,  are  obtained  for  several  examples  with  presumed  assumptions  on 
the  distributions  of  the  noise  and  the  bound.  As  can  be  expected,  the 
probability  of  error  of  the  test  is  dependent  on  the  tail  property  of  the 
noise  distribution  and,  of  course,  the  accuracy  of  the  presumed  bound  on 
the  noise  magnitude.  In  contrast  to  the  SPRT,  the  test  procedure  dis¬ 
cussed  in  this  chapter  requires  very  little  statistical  information  on  the 
noise  and  involves  less  computational  complexity,  although  it  may  be  ter¬ 
minated  with  a  larger  sample  size.  Finally,  detection  of  an  unknown  sig¬ 
nal  in  bounded  noise  is  considered  as  an  estimation  problem.  The  test 
procedure  discussed  in  Section  4.5  is  rather  different  from  the  usual 
approaches  in  the  sense  that,  at  each  time,  a  set  of  estimates  instead  of  a 
single  estimate  is  obtained.  Unfortunately,  performance  evaluation  of 
this  procedure  is  not  clear.  More  complete  results  may  be  obtaihed  in 
the  future. 
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A  SET-THEORETIC  APPROACH  TO  DETECTION  PROBLEMS 


5.1  Introduction 

In  the  literature,  detection  problems  have  almost  always  been  dis¬ 
cussed  by  the  formulations  of  statistical  inference.  Statistical  models 
are  used  to  describe  the  noise  or  the  noise  and  the  signal  These  formula¬ 
tions  usually  require  some  presumed  statistical  information  which  may 
not  be  available  precisely.  This  chapter  considers  the  detection  problem 
from  a  rather  different  point  of  view.  Set-theoretic  formulations,  instead 
of  statistical  formulations,  are  employed  to  describe  the  input  model. 
This  approach  may  be  useful  in  detecting  the  failure  of  dynamical  sys¬ 
tems.  where  problems  usually  occur  when  the  components  of  the  system 
deviate  from  the  nominal  ones  [l]  -  [3]. 

The  problem  addressed  here  can  be  considered  as  a  generalization  of 
the  one  discussed  in  Section  4.5  of  the  last  chapter,  where  detection  of  an 
unknown  signal  in  bounded  noise  is  considered.  It  is  assumed  here  that 
the  signal  is  an  unknown  constant  vector.  The  noise  sequence  is  con¬ 
strained  by  a  compact  set  in  R* .  The  detection  problem  is  solved  by 
estimating  the  signal  first.  Then,  based  on  the  estimate  of  the  signal, 
decisions  are  made  as  to  whether  or  not  the  signal  is  present.  The  pro¬ 
cedure  can  be  explained  briefly  as  follows:  at  each  observation,  a  set 
which  is  compatible  with  the  constraint  and  the  observation  data  is 
obtained.  The  estimate  of  the  signal  based  on  each  single  observation  is 
thus  a  set  rather  than  a  single  element  in  the  k  -dimensional  vector 
space.  When  multiple  observations  are  taken,  intersection  of  these  sets 


is  considered  as  the  set  of  estimates.  Then,  under  some  conditions,  this 
set  will  eventually  converge  to  a  point  which  is  the  signed  to  be  estimated. 
The  decision  of  whether  a  signal  is  absent  clearly  depends  on  whether  this 
point  is  located  at  the  origin. 

5.2  A  Detection  Problem 

Given  an  observation  sequence  (xj,  one  is  required  to  decide,  subject 
to  some  constraint,  whether  or  not  an  unknown  constant  vector-signal  is 
present.  This  problem  can  be  formulated  as  the  usual  hypothesis  testing 
problem  as  follows: 

H  :Xi=Ui 

K  :Xi  -  s  +  n*  i  =  l,  2,  3, .  (1) 

where  x.  s,  and  n  are  k  -vectors.  Also  n  is  a  random  sequence  constrained 

by  the  following  relation 

n <eS<  (2) 

where  5*  is  a  compact  set  in  Rk .  In  accordance  with  the  discussion  in 

Sections  4.4  and  4.5  of  the  previous  chapter,  this  detection  problem  will 
be  cast  as  an  estimation  problem  here.  The  estimates  of  the  signal  dis¬ 
cussed  here  will  be  a  set  rather  than  a  single  vector  in  the  A -dimensional 
vector  space.  Thus  the  solution  of  the  detection  problem  of  Eq.(l)  is 
equivalent  to  finding  a  set  0*  in  Rk  with  Hi  being  compatible  with  Eqs.(l) 
and  (2).  Note  that  every  element  in  is  an  estimate  of  the  signal  a. 

To  find  the  set  0*  is  conceptually  straightforward.  From  each  obser¬ 
vation,  a  set  compatible  with  Eqs.(l)  and  (2)  is  obtained.  Every  element 
in  this  set  is  thus  an  estimate  based  on  this  observation  After  i 
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observations  are  made,  the  set  of  estimates  is  simply  the  intersection  of 
these  sets  obtained  at  each  observation.  Hence  the  set  Hi  can  be 
expressed  as 

Obviously,  the  sequence  of  sets  j^i  is  a  monotone  non-increasing 
sequence,  namely 

fli  a  Clz  a  Os  a . 

Under  some  specific  assumptions,  this  sequence  will  converge  to  a  point. 
The  decision  of  whether  or  not  the  signal  is  present  then  depends  on 
where  this  point  is  located  in  the  /?* .  Similar  to  the  results  obtained  in 
the  previous  chapter,  the  solution  to  this  detection  problem  is  singular 
due  to  the  set-theoretic  assumptions  used  here. 

Unfortunately,  in  general  formulations  of  the  sets  fl*  are  not  compu¬ 
tationally  simple.  Suppose  that,  for  example,  the  constraint  set  S*  is  an 
intersection  of  two  half  spaces.  Then  at  each  time  instant,  the  set  0* 
defined  in  Eq.(3)  is  a  polytope.  Formulation  of  a  polytope  is  usually  a 
numerically  complicated  task.  One  possibility  to  avoid  this  difficulty  is  to 
define  a  set  which  can  be  more  easily  formulated  and  which  bounds  0, 
"tightly"  (of  course,  the  word  "tightly"  has  to  be  precisely  defined).  A 
good  candidate  for  the  bounding  set  in  this  case  will  be  a  k  -dimensional 
ellipsoid  which  can  be  formulated  in  terms  of  quadratic  equations.  Note, 
however,  that  when  the  bounding  set  is  used  to  proceed  with  the  estima¬ 
tion  procedure,  there  may  exit  a  non-empty  subset  of  this  bounding  set 
that  contains  elements  which  are  not  compatible  with  the  constraints  and 
the  observed  data.  Thus  it  is  no  longer  true  that  every  element  of  the  set 
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is  an  estimate.  Furthermore,  when  only  a  finite  number  of  observations 
is  used,  decisions  based  on  this  set  of  estimate  may  not  be  correct. 
Hence  false-alarms  and  misses  will  occur. 


5.3  Detection  of  Constant  Signals  in  Bounded  k-Variate  Noose 

The  detection  problem  given  in  Eq.(l)  is  considered  here  again.  Now, 
the  noise  sequence  is  subject  to  the  following  constraint 


nf  n*  *  1  v  i 

Under  H,  Eq.  (4)  implies  that 

(4) 

*  1 

Under  K,  it  becomes 

(5) 

s  1 

After  some  algebraic  manipulations,  £4  (6)  becomes 

(6) 

(»-*)*! 

where 

(7) 

■e  =  Pi*i  Pi  =  4xfc 

The  Eq.  (7)  stands  for  a  k -dimensional  ellipsoid  centered  at  s^.  In 
fact,  it  is  a  4: -dimensional  spheroid  since  P*  is  an  identity  matrix.  If  one 
takes  multiple  observations,  each  observation  would  result  in  a  k- 
dimensional  ellipsoid  as  given  by  Eq.  (7).  Thus,  after  taking  M  observa¬ 
tions,  the  set  of  estimates  of  the  signal  is  the  intersection  of  all  these 
ellipsoids,  and  thfe  geometric  center  should  be  a  good  estimate  of  s,  when 
a  single  estimate  is  required.  However,  the  intersection  of  ellipsoids  is 
very  difficult  to  formulate;  hence  its  geometric  center  becomes  a  fuzzy 
concept.  A  possible  way  to  get  around  this  problem  is  to  consider  an 
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ellipsoid  which  "tightly"  bounds  the  intersection.  A  "tightly  bounding 
ellipsoid”  is  defined  as  follows  [4] 

Definition  :  An  ellipsoid  E  which  bounds  a  set  5  is  said  to  be  tight  if 
E  2  E'  35  implies  that  E'  =  E,  for  any  subset  E'  of  E. 

Nevertheless,  this  definition  only  defines  a  "tightly"  bounding  ellipsoid  in 
a  certain  direction.  There  are  infinitely  many  "tightly"  bounding  ellip¬ 
soids.  In  order  to  obtain  an  efficient  procedure,  these  ellipsoids  should 
be  chosen  in  some  optimal  way. 

Let  \Eii  be  e  sequence  of  optimal  bounding  ellipsoids  and  Si  be  the 
sets  specified  by  Eq.  (6),  then, 

=  ft :  (s  -  «c  (<-l))r  Pf-\  (s  -  ic  (i-1))  (B) 

and 

Si  =  (s :  -  u)T  (*  -  ■)  i  1 1 

An  ellipsoid  which  "tightly"  bounds  the  intersection  of  Ex  and  S*  is  formu¬ 
lated  as  follows 

E  =  f  s  :  (s  -  *)r  PiL\  (s  -  *)  +  ^  (*  -  s)r  (**  -  s)  i  1  +  pi  | 
where  0  i  p<  <  ».  After  some  algebraic  manipulations,  one  can  show  that 

=  ?”(■-*  )**l  (9) 

where 

Q~*  ~  P\- \  *  Pi 
•c  =  Q  [  Pi-\  Mi-O  +  Pi  *  3 
and 

$  =  l  +Pi  —  (Mi— 1)  “  *i)T  G  (^(i-1)  -*») 

Clearly,  the  parameter  p*  determines  the  orientation  (and  even  the 
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size)  of  the  bounding  ellipsoid.  Finding  a  sequence  of  optimal  bounding 
ellipsoids  [Ei\  is  equivalent  to  finding  a  sequence  of  optimal  (p<|.  One 
can,  for  example,  consider  an  ellipsoid  with  minimum  volume  as  an 
optimal  one.  It  is  known  that  the  volume  of  an  ellipsoid  formulated  by  Eq. 
(9)  is  linearly  proportional  to  the  determinant  of  the  matrix  Q~l/6.  Thus 
minimizing  the  determinant  with  respect  to  p*  will  result  in  an  optimal 
bounding  ellipsoid.  The  problem  of  finding  optimal  bounding  ellipsoids 
with  minimum  volumes  has  been  examined  in  the  literature  [5]  and  thus 
will  not  be  repeated  here.  It  should  be  noted  here  that  both  £*_i  and  5< 
contain  the  true  signal  vector.  Therefore  the  intersection  of  them  is  not 
an  empty  set.  Furthermore,  it  can  be  shown  that  6  is  a  positive  real 
number;  thus  the  ellipsoid  described  by  Eq.  (9)  is  non-degenerate.  Fig. 
5.A  shows  the  bounding  ellipsoid  which  contains  the  intersection  of  the 
observation  set  and  the  previous  ellipsoid. 

Now,  a  sequential  algorithm  for  solving  the  detection  problem  of  Eq. 
(1)  can  be  developed  in  the  following: 

(i)  Start  with  a  very  large  spheroid  centered  at  the  origin,  namely 

£*0  =  f  * •  sr  Pqx  t  *  i  J 

where 

Pol  -o  In 

and  a  is  a  very  large  real  number. 

(ii)  Successively  take  observations,  compute  xf  z*  for  every  i.  If  one 
ever  obtains 


Xj  >  1 


for  any  i 


-  'N. 
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Fig-  5. A  An  Illustration  ol  The  Bounding  Ellipsoid  Et 
and  The  Observation  Set  St 
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decide  K.  Otherwise,  calculate  an  optimal  bounding  ellipsoid 
based  on  the  current  observation  and  the  previous  ellipsoid. 

(iii)  As  the  number  of  observation  increases,  se(i)  should  become 
closer  and  closer  to  s.  Therefore,  decide  H  if  se(i)  -*  0,  other¬ 
wise,  decide  K. 

However,  use  of  the  decision  criteria,  Step  (iii),  may  require  an  infinite 
number  of  samples.  In  reality,  the  procedure  has  to  be  stopped  at  some 
finite  number  of  samples.  To  make  the  algorithm  more  practical,  one 
may  choose  a  neighborhood  of  the  origin  0  such  that  if  ^(i)  e  O  and  if 
flj  3  |0j,  H  will  be  accepted. 

5.4  Performance  Evaluation 

Since  the  sequential  estimation-detection  algorithm  discussed  in  this 
chapter  is  significantly  different  from  the  conventional  ones,  its  perfor¬ 
mance  will  also  be  evaluated  from  a  rather  different  point  of  view.  Notice 
that  the  set  A  defined  in  Eq.  (8)  is  different  from  the  set  of  estimates  A 
as  specified  in  Eq.  (3).  As  a  matter  of  fact,  it  is  usually  true  that  the 
difference  set  of  A  and  A  is  a  non-empty  set.  Let  A  ^  A  -  A .  then  the 
elements  of  A  are  not  estimates  that  are  compatible  with  the  constraint 
set  and  the  observation  data.  To  evaluate  the  performance  of  this  detec¬ 
tion  algorithm,  we  proceed  as  follows  : 

Let  Mp(i)  and  tig(i)  be  the  set  functions  for  the  measures  of  A  *nd 
A  respectively.  Obviously, 

lim  ~  0  and  lim  =  0 
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Based  on  the  sequential  algorithm  here,  the  false-alarm  occurs  when 
Ei  d  (Of  but  ^(i)  e  O  at  the  stop.  Also,  the  miss  occurs  when  £*  3  (Of 
and  ^(i)  £  0  at  the  stop.  To  be  more  specific,  one  should  define  a  dis¬ 
tance  measure  d(sti%)  (which  may  be  the  Euclidean  norm)  of  two  vectors 
S]  and  «2  on  the  space  Rk .  Then  a  neighborhood  of  the  origin  0  is  defined 
by  properly  choosing  a  small  positive  real  number  t  such  that 

0  =  (a  :  d(s,0)  <  e  j 

The  decision  criteria  now  is  re-stated  below 

(a)  If  >  1  for  some  i,  decide  K. 

(b)  If  Ei  *  (Oj.  decide  K. 

(c)  If  Ei  D  (Of  while  d(Se  (i),0)  >  c  ,  decide  K.  On  the  other  hand,  if 
Ex  o  (Of  and  d( s^(i).O)  <  e,  decide  H. 

5.5  Concluding  Remarln 

A  detection  problem  using  no  statistical  assumptions  on  the  input 
model  has  been  discussed  in  this  chapter.  The  only  assumption  imposed 
here  is  that  the  noise  is  constrained  by  a  compact  set  in  Rk .  Due  to  this 
set-theoretic  assumption,  the  solution  thus  obtained  is  rather  different. 
The  detection  scheme  devised  here  is  essentially  a  combination  of  esti¬ 
mation  and  detection  with  a  sequential  nature.  However,  since,  it  is 
required  to  estimate  the  signal,  the  discrimination  between  H,  the 
hypothesis,  and  K,  the  alternative,  may  be  insignificant  when  a  weak  sig¬ 
nal  is  encountered. 

As  for  evaluating  the  performance  of  this  scheme,  set  functions  are 


introduced.  Unfortunately,  due  to  the  computational  difficulties  involved 
in  formulating  intersections  of  compact  sets,  the  result  presented  here  is 
somewhat  preliminary.  Finally,  it  should  be  stressed  here  that  finding  a 
sequence  of  optimal  bounding  ellipsoids,  Section  5.3,  yields  a  facility  for 
evaluating  the  information,  contained  in  each  observation  set,  which  is 
pertinent  to  the  updating  of  the  estimates.  This  reduces  the  computa¬ 
tional  complexity  involved  in  the  estimation  procedure. 
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CONCLUSIONS 


6. 1  Summary  of  Results 

The  method  of  series  expansions  is  employed  in  this  report  to 
obtain  approximations  to  the  test  statistics  of  some  optimal  detectors. 
The  Edgeworth  series  is  used  to  devise  an  approximate  locally  optimal 
detection  scheme.  Convergence  properties  are  addressed  here.  Under  a 
certain  condition,  the  Tauberian  condition,  this  approximation  converges 
to  the  locally  optimal  statistic.  However,  it  is  shown  that,  even  if  this 
condition  is  not  satisfied,  the  approximate  scheme  may  still  provide  "rea¬ 
sonably  good"  performance,  e.g.  the  ARE  being  greater  than  unity  as 
compared  to  the  linear  detector.  The  other  achievement  of  this  study 
in  the  application  of  series  expansions  is  an  investigation  of  the 
effect  of  the  noise  skewness  on  the  detector  performance.  Asymptotic 
relations  between  the  performance  and  the  noise  skewness  are  obtained 
for  both  the  sample  mean  detector  and  the  sign  detector.  The  sign 
detector  is  shown  to  be  less  sensitive  than  the  sample  mean  detector  to 
the  noise  skewness,  an  intuitively  reasonable  result.  Furthermore,  a 
modification  of  the  sample  mean  detector  is  proposed.  Performance  of 
this  modified  scheme  in  skewed  noise  is  asymptotically  equivalent  to  that 
of  the  sample  mean  detector  in  Gaussian  noise. 

The  second  part  of  this  report  is  concerned  with  signal  detec¬ 
tion  in  bounded  noise.  When  the  signal  is  assumed  to  be  a  known  con¬ 
stant,  a  sequential  solution  is  obtained.  By  assuming  that  the  bound  is  a 
random  variable,  a  random  test  results.  Evaluation  of  this  test  thus 
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requires  the  statistical  moments  of  the  false-alarm  rate  and  power.  On  j 

the  other  hand,  if  the  signal  is  an  unknown  constant,  an  estimation- 

detection  procedure  is  proposed.  Unfortunately,  performance  evaluation 

of  the  procedure  is  still  an  unresolved  question.  Finally,  a  set-theoretic 

formulation  of  a  vector-signal  detection  problem  is  discussed.  Again, 

estimates  of  the  signal  are  obtained  to  make  decisions  as  to  whether  or 

not  the  signal  is  present.  Since  no  statistical  assumptions  are  imposed  : 

here,  performance  can  not  be  evaluated  conventionally. 

6.2  Future  Research 

The  problem  of  modifying  the  sample  mean  detector  (or  the  linear 
detector)  using  methods  of  series  expansions  has  been  addressed  here. 

More  sophisticated  series  expansions  may  be  considered  to  achieve 

*■;  .  < 

better  approximations  and  convergence  results.  As  for  examining  the 
noise  skewness  on  detector  performance,  the  analysis  here  is  based  on 
large  sample  size  and  small  skewness  assumptions.  Other  methods  may 
be  used  to  remove  these  two  assumptions. 

The  performance  evaluation  for  signal  detection  in  bounded  noise  is 
not  yet  well-resolved.  Furthermore,  the  number  of  samples  required  to  ’ 

terminate  the  sequential  procedure  may  be  large.  Some  revision  of  this 
scheme  will  be  an  interesting  problem.  Finally,  it  is  the  author’s  belief 
that  the  set-theoretic  formulations  discussed  in  Chapter  5  deserves  more 
consideration.  Failure  detection  of  dynamical  systems  may  be  one  of  the 
applications. 
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